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Technical Fieid 

Th present invention is directed to a niethod for producing humanized nrK>nodonal antit>odies by utilizing 
comparative model buOding to construct the humanized antitx)dy from honK>logous regions of human proteins 
5 by rational design. Specific humanized monoclonal antibodies are prepared. 

Background of the Invention 

Murine derived nwnodonal antit>odies have been utilized as diagnostic and therapeutic agents for numer- 

10 ous human pathdogic conditions including acute inflammatory responses associated with nunierous diseases. 
Administration of murine derived monoclonal antilxxiies (mAbs) as therapeutic agents in n^n has been se- 
verely limited by the development of antibody within the redpient to the mouse antigens of the murine derived 
monoclonal antibody. In attempts to circunrrvent this outcome mAbs have been restructured by recombinant 
DNA technology in such a way as to decrease their immunogenicity in humans. Immunoglobulins are well de- 

15 fined both chemically and biologically with the general structures illustrated in Molecular Cell Biology. Darnell, 
Lodish and Baltimore, Eds.Scientific American Book. Inc. W.H. Freeman, New York, NY (1986). Initially, this 
involved the construction of chimeric antit>odies, Morrison et aL Proc. Nati. Acad. Sd. USA 81: 6851-6855 
(1984). Recombinant technology was employed to replace the murine heavy and light chain constant regions 
with corresponding hurrian constant regions. Upon expression, such interspecies antibody chimeras yielded 

20 molecules with the ant^en binding specif k:fties of the parent nnurine antibody. The following references gerv 
erally describe chinr^ric antibody technology: Lobuglk) etal. Proc. Nati. Acad. Sci. USA86: 4220-4224 (1989) : 
United States Patent 4.816,567 ; PCT International Publicatbn No. WO 87/02671, published May 7, 1987 ; Eu- 
ropean Patent Publication No. 255,694, published February 10, 1988 ; European Patent Publication No. 
274.394. published July 13. 1988 ; European Patent Publication No. 323.806. published July 12, 1989 ; PCT 

25 International Publication No. WO/89/00999, published February 9, 1989; European Patent Publication No. 
328,404, published August 16, 1 989; European Patent Publication No. 322, 424 published September 1 3, 1989, 
and European Patent Publication No. 438.310 published July 24. 1991. 

The immunogenicity of chimeric antibodies can be further reduced by grafting rodent hypervariable re- 
gions into the variable region frameworks of human light and heavy chains. Jones etal. Nature 321 : 522-525 

30 (1 986). These hypervariable regtons have also been termed complenientarity determining regions (CDR). The 
technique involves the substitution or recombinant grafting of antigen-specific murine CDR sequences for 
those existent within 'generic' human heavy and light chain variable regions. European Patent Publication No. 
239.400, published September 30, 1987. In this approach. I ittie, if any, concern is shown for the variable region 
frameworks (FRs) within which the murine CDR's are placed. 

35 Studies by Queen etal, Proc. Nati. Acad. Sci. USA 86 : 10029-10022 (1989), have shown the CDRs from 

a murine anti-Tac monoclonal antit>ody can be grafted into a human fran^ework. The human framework variable 
regk>ns were chosen to maximize identity with the murine sequence. The authors also utilized a computer mod- 
el of the mMab to identify several amino adds whrch, while outskie the CDRs, are dose enough to interact 
with the CDRs or antigen . These residues were mutated to the residue found in the murine sequence. The 

40 grafted anti-Tac antibody had an affinity for the antigen which was only atxHJt 1/3 that of the murine anti-Tac 
rhAb. 

Leukocyte infiltration into an inflammatory site is dependent on the adhesk>n of the leukocytes to the en- 
dothelium prior to extravasation. The rapid binding of potymorphonudear leukocytes (PMN) to the endothelium 
and diapedesis occurs within minutes after the introduction of a chemotactic stinujlus in tissue. Cybulski etal .. 

45 Am. J. Pathol. 124 : 367 (1 986). This rapid extravasation appears to depend on the response of the PMNs to 
chemoattractants and on the presence of the CD11/CD18 famSy of glycoproteins on the leukocyte surface. 
The family of glycoproteins assodated with PMNs are termed leukocyte integrins and indude LFA-1 
(CD11a/CD18). Mac-1 (CD11b/CD18) and p150,95 (CD11c/CD18). Each of these heterodimers has a unique 
alpha chain (CD1 1 a, b. c) and an invariant beta-2 chain (CD1 8). Stimulation of PMNs with various chenDOtactic 

50 factors causes increased expressnn of leukocyte integrins (CD11b/CD18) fostering strong adhesion to unsti- 
mulated endothelium i n vitro; Harian. Blood 65 :513 (1985). Essentially all of the chenKiattractant-induced ad- 
hesk3n is inhibited bytreatingthePMNswithmMAbsspedfk:aIly reactive with the CD11 /CD 18 complex. Harian 
etal.. Blood es : 167 (1985); Zimmerman and Mdntyre J. Qin. Invest 81 : 531 (1988); Smith et al. . J. Clin . 
Invest 82 : 1746 (1988) and Lo etal.. J. Exp. Med. 169 : 1779 (1989). 

55 Murin hybridomas produdng nK)nodonal antibodies reactive with the beta chain comnrK)n to the Mac-1 . 
LFA-1 and th p150.95 integrins have been described. Th mMAbs are d signated 1B4. 60.3. TS1/18. H52 
and ATCC TIB 218. The 1B4 is an lgG1 antibody and was prepared by Wright tal. Proc Nati. Acad. Sd. USA 
80 : 5699-5703 (1983), the 60.3 is also lgG2a and was prepared by Beatty tal. J. Immund. 131 :2913-2918 
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(1983), TS1/18 is an IgGI antibody and was prepared by Sanchez-Madrid tal. . J. Exp. M d. 1^ : 1785-1803 
(1983), and ATCC TIB 218, a lgG2a kappa prepared by Springer etal , J. Exp. Med. 158 : 586-602 (1983). 
These antibodies appear to be functi nally qurvalent and cross-react with th beta 2 - chain found nhunrtan, 
sheep, pig, rabbit and dog leukocytes but n t with the beta-2 chain found n murine and rat I ukocytes. 

5 

Sumnnary of the Invention 

The present invention is directed to a method for producing humanized monoclonal antibodies by utilizing 
a process of comparative nrKxiel buOding. In this method computer data bases are searched to locate homo- 

10 logous human protein sequences that correspond to specified regions of the non-human derived (usuaQy nmjr- 
ine) antibody, and a series of nrK>dels is formulated, tested and modified to produce a nrxxJel of a humanized 
antibody which ts then constructed by recombinant DNA technology. In a preferred embodiment, a humanized 
nrwnodonal antibody corresponding to the murine anti-CD18 antibody 60.3 was prepared. 

The variable (V) region sequences from both the heavy (H) and light (L) chains were determined from cDNA 

15 (amplified by PCR), and splteed onto human constant (C) regions, resulting in a chimeric 60.3 Ab (Ig61 , kappa). 
The chinDeric Ab was expressed in tissue culture (Ag8.653 mouse myeloma cells, detected by EUSA), and ex- 
amined in binding assays. The results from competition and inhibition assays showed that the chimeric Ab was 
as effective as the murine 60.3 mAb. 

The deduced murine Vh and V|. protein sequences were compared to the protein sequence data base, and 

20 two hunrmn Ig protein sequences were selected to be used as templates. The present inventors modeled a mur- 
ine 60.3 Fv according to the deduced Vh and Vl protein sequences. Based on the 60.3 Fv nrKxiel and the two 
human template sequences selected from the protein data base, a humanized Fv was nrK>deled. 

Construction of the humanized 60.3 was done by piecing 5 pairs of complementary oligonudeotkJes to- 
gether (spanning the entire V region) to form the VH and VL TTiese were then attached onto vectors containing 

25 genes for appropriate C regk)ns to form hunnanized Ab (IgGI , kappa). TTie humanized proteins were again ex- 
pressed in Ag8.653 cells and binding assays were done. FACS analyses indicated that the humanized Ab rec- 
ognized cells expressing GDI 8. About a dozen of the humanized 60.3 Ab master wells were transferred and 
assayed for Ig. 

30 Brief Description of the Drawings 

In the drawings: 

Figure 1 Qlustrates an amino ackj comparison of the murine 60.3 antibody heavy chain (m60.3) with the 
human variable heavy chain consensus sequence for the framework regions of human subgroup VhI 
35 (hVh1/Jh4), the human template (M030) used for humanization (h60.3 template), a germline sequence hom- 
ologous to M030 (21 -2 'CL), and phases I to IV of the hun^nization process. All amino ackis whrch are Identk^al 
to the phase IV sequence are shaded. In additk>n, sequences which are different than the previous phase se- 
quence are shown in bold. 

Figure 2 Olustrates an amino add residue comparison of the murine 60.3 antibody light chain (m60.3) with 
40 the human variable light chain consensus sequence for the framework regk)ns of human subgroup 
Vklll(hVklll/Jk), phases l-IV of the humanizatton process and the human template used for the humanization 
(h60.3 template). All amino ackis whk:h are kJentical to the phase IV sequence are shaded. In additk>n, se- 
quences which are different than the previous phase sequence are shown in bdd. 

FIGURE 3 illustrates five pairs of complementary oligonudeotkies corresponding to the variable regkms 
45 of the light chain. 

FIGURE 4 Olustrates five pairs of complementary digonudeotides corresponding to the variable region 
of the heavy chain. 

FIGURE 5 ilkistrates the binding of murine, chimeric and humanized 60.3 antibody to HL60 human mye- 
lonrranocytic cells. Ruorescein isothk>cyanate (FITC) labelled antibody was incubated with cells at the concen- 
50 trations indkrtated on the abscissa and the anK)unt of antibody bound is indicated by rdative fluorescence in- 
tensity on the ordinate. 

FIGURE 6 illustrates the competitton by preincubatk>n of cells v^h chimeric and humanized 60.3 of the 
binding of FITC-labelled murine 60.3 to HL60 cells. HL60 cells were preincubated with 1 ug/ml of either chimeric 
60.3 (drdes) or hunr^nized 60.3 (squares), followed by incubation with various concentratnns of FITC-labelled 
55 murine 60.3. In the absence of competing antibody, FITC - m60.3 binding to the HL60 cells increased with 
increasing concentration (x). 

FIGURE 7 illustrates the direct competiti n fFITC-murin 60.3 binding to HL60 cells by chimeric and hu- 
manized 60.3. The dashed line shows th fluorescent intensity of binding by FITC-murine 60.3 in the absence 
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of com^titor, while additions f increasing concentrations of chimeric 60.3 (squares) and humanized 60.3 (cir- 
cles) inhibited FITC-m60.3 binding. 

FIGURE 8 ilustratesth results of a chemilumin scence binding assay fnnjrin (closed square), chimeric 
( pen square) and humanized (closed dianwnd) 60.3 antlKxfy upon HL60 cdls. Ih anticancer antibody Lj6 
5 (open diamond), which does n t bind to HL60 cell , was used as a control. 

FIGURE 9 Olustrates a series of restrictbn maps for plasmids utilized In the productk)n of the variable light 
chain plasmid pGK.11. 

FIGURE 10 illustrates the nucleotide sequence for the humanized variable light chain. 

FIGURE 11 ntustrates a series of restriction maps for plasmids utflized in the production of the variable 
10 heavy chain plasmid pNyl .16. 

FIGURE 12 illustrates the nucleotide sequence for the humanized variable heavy chain. 

Description of PrefiBrred Emtxxjiments 

15 The present invention is directed to a method of producing humanized nrK>noclonal antibodies (mAbs) by 
utiizing a process of comparative model building and rational design. In a preferred embodiment this method 
is utaized to produce a humanized nrK>lecule of the anti-CD18 murine monoclonal antitKxty 60.3. The nKMJse 
mAb 60.3 (lgG2a), which recognizes a functional epitope on the beta subunit (CD18) of leukocyte integrins, 
prevents the adherence and aggregation of polymorphonuclear neutrophils (PMN), resulting in a blockage of 

20 neutrophO mediated damage during shock and reperfusion injury. It would be advantageous to modify mAb 
60.3 to reduce the potential for HAMA (human antnmouse Abs) response. Therefore, in one emt)Odiment the 
present invention was directed to "humanize" the 60.3 mAb by creating an Ab whose constant (C) region is 
human, but whose variable (V) regbn is composed of both human (principally framework sequences) and 
mouse (principally GDR loops) sequences. For the studies described in the present inventksn, murine, chimeric 

25 and humanized antibodies were purified from solution by protein A chromatography on IPA - 400 Fast Row 
Imnmbilized rProteinA(Repligen, Cambridge, MA) using the manufacturer's recommended protDcol. 

In the present inventton recombinant methods are utilized to produce hunranized nrK>noclonal antibodies 
that contain complementarity determining regk)ns (CDRs) analogous to the originally derived monoclonal an- 
tibody, and which have homologous human heavy and light chain framework regions. The resulting antibodies 

30 demonstrate the binding affinity and specificity of the original antibody yet are completely humanized mono- 
clonal antibodies. 

As used herein the term "humanized" and its various grammatical forms as it relates to antibodies is de- 
fined to mean that the amino add residues of the antibody in the heavy and light chains are replaced with 
amino add residues corresponding to homologous human protein regions without altering the binding activity 

35 of the antibody. For the humanized 60.3 nranodonal antibody of the present invention there is approximately 
80% sequence identity of the variable regbns of the heavy and light chains with those of the human mAb, 
while the constant regk>ns are distinctly human. Some variation of indivklual amino adds in the antigen binding 
and framework regbns are contemplated by this invention and are within the scope of this inventton when such 
variatk)ns do not interfere or inhibit the binding to antigen, such as the lie for Glu substitutk)n at position 1(^ 

40 of the light chain. 

As used herein the term "canonical loop conformation" refers to a small repertoire of main chain confor- 
mations for f ive of the sbc loops (all except H3). The particular conformatk>n adopted is determined by only a 
limited number of residues within the loop or the framework. 

As used herein the term "frantework residues" means resklues whk^h are located outside the structurally 
45 defined GDR loops. These residues can be part of the hypervariatrie regions for the antibody. 

As used herein the term "monodonal antibody" refers to all recombinant antibodies derived from an initial 
single cell and indudes murine monodonal antibodies, chimeric antibodies and humanized antibodies. 

In the present invention a procedure of comparative nrK>del buflding was utilized to construct the appro- 
priately designed humanized antibody. As a preferred embodiment, the modeling of the murine 60.3 antibody 
50 is summarized. 

The Brookhaven Protein Database (Bernstein et al. (1977) J. Md. Bid. 122 : 535-542) was searched for 
the antibody crystal structures which show reasonably high honrK)logy (> 50% sequence identity) to the variable 
regbns of murine 60.3. If the variable l^ht chain and heavy chain templates whteh fiilf HI these criteria are from 
different antibodies, these structures are combined by superposition of the set of structural invariants at the 
55 Vl- Vh domain interface (N votny et al. (1985) Proc. Nati. Acad. Sd. USA 82 : 4592-4596). This provides the 
"structural template* for model building of murin 60.3 (and humaniz d 60.3 below). 

Th GDR loops and th ir known structural firam workd term Inants of murin 60.3 are d termined by de- 
fining the GDR loops structurally according to th ntethod of Chothia et al. (Ghothia et al. (1989) Nature 342 : 
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877-883). Th structurally d fined CDR loops consist on average of shorter sequence segm ntsthanth hy- 
pervariabl regions defined by Kabat (Kabat t al (1 987) Sequences of Proteins of Innmun logical interest). 
Th f iv canonical CDR loops (L1-13, H1-H2, i.e. all except H3) in the 60.3 variabl light and heavy chain are 
assigned to known can nical loop confornDations, and the franiework residues which are crucial for the con- 

5 fbrmati ns fth CDR loops are d termined. 

The norv-canonical H3 loop region within the 60.3 sequence is defined and a model of murine 60.3 ts then 
built The CDR loops of the structural template are replaced with canonical CDR backt>one templates as de- 
termined using interactive computer graphbs (INSIGHT II. Ver 2.0 Biosym Technologies, Inc. 1991). Loop 
searches (Jones TA(1986) Em bo J. : 819-822) in the Brookhaven Protein Data Bank are carried out to extract 

10 an initial backtx>ne approximatton for the non-canonical CDR loop H3. 

All non-conserved amino add side chains in similar positions are replaced using interactive computer 
graphics. The model then consists of a combination of backbone fragments of different antibodies with replaced 
side chains. The nfKxiel is sohrated with a 7 A water layer and the structure is refined using an energy minimi- 
zation (Mackay et al. (1989) PrBdictk>n of Protein Structure and the Principles of Protein Conformatton. New 

15 York: Plenum Press pp.31 7-358) protocol where, over the course of 1600 cydes of conjugate gradients mini- 
mtzation, constraints of 80 kcaI/nrN>l/A2 on all protein non-hydrogen aton^ are gradually released until, at the 
final stage of the minimizatnn procedure, all atoms of the system are free to nrave. 

The nrK>st honrnbgous hunrtan variable region sequences are found by searching the sequence data base 
for the most homologous human sequences for the variable light and the variable heavy chains of the 60.3 

20 antibody and these sequences are combined to obtain a "human template". The structural template for murine 
60.3 is confirmed to be suitable for the human template. The sequences of the structural template chosen ini- 
tially showed > 50% sequence identity to the variable regions of the "human template". Furthermore the per- 
cent homology is chosen to t>e similar to that found for comparison of the structural template with the murine 
sequence. The CDR loops and known structural determinants are then grafted onto the human template 

25 (Jones et al. (1 986) Nature 321 : 522- 535). The CDR loop regions and structural determinants in the "human 
template" sequence are replaced by the analogous sequences from the murine antibody, as determined above. 
This provides the Phase I h60.3 sequence. A Phase I model of hunnanized ^.3 ts t>uilt using the same model 
building protocol as described for murine 60.3. 

In Phase II the nmjrine and Phase I h60.3 models were compared. These nKxJels now consist of the nrujrine 

30 binding site and murine framework (nmjrine 60.3 or m60.3) and of murine binding site and human frannework 
(Phase I humanized 60.3 or h60.3). The of murine and Phase I h60.3 were superimposed using the structural 
invariants of the inununogtobulin fdd (Novotny et al. (1985) Natl. Acad. Sd. USA 82 : 4592-4596). The models 
of the binding site regions were compared reskJue by residue from the N-terminus to the C-terminus. By this 
comparison, all framework residues and residueswithin the framework - CDR junctk)ns which can interact with 

35 the murine CDR loops and may therefore be important for the structural integrity of the murine binding site 
were kjentified. These resklues typk^ly indude all the known structural determinants for the specified can- 
onical CDR loop conformatbns (Chothia et al. (1989) Nature 342 : 877-883) and other residues found to be 
critical in the comparison (due to proximity to the CDR loops and potential for interaction with them). These 
residues were then "re-mutated" to the murine reskjues, forming the Phase II h60.3 model. 

40 The murine 60.3 nrKxlel and the nuxiified hunnanized sequence were then further refined by again sut>- 
jecting the models to the energy minimization procedure described above. This construct represents the Phase 
II h60.3 model. 

In Phase 111, further improvements of the sb-uctural nrK>del of h60.3 were made A conformational search 
(Bruccderi RE and Karplus M. (1987) Bk>polynners 26 : 137- 168) was carried out over regtons of the binding 

45 site which cannot be directly assigned to known structural templates. Typically, thb is the CDR k>op H3 and 
perhaps one or more CDR loops which may not belong to known canonical structure types. SkJe chain con- 
formations of the antigen binding site loops and the framewcH'k - CDR junctions are also further refined using 
an iterative confer mattonal search protocol (Bruccderi RE and Karplus M. (1987) Biopotymers 26 : 137-168). 
The refined model structure may be called Phase III h60.3. 

50 In Phase IV, analysis of the binding site features of the Phase III h60.3 nrxxlel was carried out The binding 
site features of the construct were armlyzed in detail in order to dassify the antibody structure, for example, 
as a "groove-type" or "cavity-type" or "flar antit>ody. This allows one, in the absence of detailed structural 
knowledge of the antibody-antigen complex, to postulate which parts of the CDR surface or residues at the 
CDR-framework junctions are unlikely to be involved in antigen binding. In the Phase III and earlier nrKxids, 

55 these positbns nrtay be occupied by murin resldu s which can now be changed to human reskJues. 

Thisinnprovesth "degreeofhumanizatk>n"oftheantibodysince parts of som CDR loops and th r ntirs 
CDR loops can be "humanized". At this ^g . th final v rsi n of humanized 60.3, Phas IV h60.3 was ob- 
tained. 
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Comparativ molecular modeling has been used here to enable a d tailed three-dim nsional comparteon 
ofamurin antibody and its humanized version. This comparativ study has nabledth present inventors to 
analyze residue-residue interactions which are lik ly to be critical to retain the nrrurin pecif idty in the struc- 
tural context of a largely human antibody. Furthermore, th different nrK>deling concepts based n structural 
5 homology ( xp rim ntal structural data) and conformational search (which represents an abinitio m thod) 
have been combined to obtain the best possible picture of the 60.3 binding site in order to, gain some insight 
Into which of the binding site residues may be not involved in antigen binding. 

In addition to the application summarized above, comparative model building can be applied to other prob- 
len^. For example, many of the antibody structures which are modeled today are used to guide mutagenesis 
10 experiments in order to explore affinity and antigen specifk:ity. Such antibodies are often nrK>deled t>ecause 
experimental structures are not available for these antibodies. Comparative model building provides an op- 
portunity to assess the confidence level of such theoretically derived structures. 

For example, the combining site of a clinically relevant antibody can be derived starting firom different struc- 
tural templates and employing the diffarent methods based on structural homology and conformational search. 
IS By pairwise combination of two different templates with two different methods, four nriodel structures can be 
derived in an independent way and then compared by superposition of structural invariants. This comparison 
allows for the determination of how well the independentty derived structures agree and which parts of the 
models do not show satisfying agreement In the absence of experimental structural data, such comparative 
model tHiilding exercises presentiy provide the only way to assess the confidence level of antibody models. If 
20 the independently derived structures agree well, a high confidence level can t>e assigned to the rnodeH and a 
'consensus nradel' can be prepared. The consensus model would then typically represent a combination of 
structural elements derived by structural honralogy and conformational search. On the other hand, dteagree- 
ment of the models allows for the identification of the particular critical regions in model structures which are 
less well defined and need to be improved or, if this is not possible, treated with caution. Such knowledge, 
25 obtained by comparative model building, is very Important for the use of model structure for experimental de- 
sign. 

Although the humanized 60.3 Ab was prepared by grafting the murine CDRs onto the hun^n frameworks, 
certain amino ackis were not changed firom the murine protein sequence to their human counterpart (due to 
their importance in retaining the conformation of the CDR loops). Therefore, three humanized 60.3 L "mutants" 
30 (based on computer modeling) were constructed in an effort to 1) further reduce its divergence and 2) deter- 
mine the contribution of these amino adds on antigen (Ag) binding. 

There are 4 amino adds in the Lchain which are changed: one is in the CDR2 (postulated not to be invoked 
in binding) and the remaining amino adds all reside in the framework 2 region. The three humanized 60.3 L 
"mutants' are as fdlows: 
35 1) Mutant 1: amino acki change is only at position 50, from Arg (R) to Asp (D). 

2) Mutant 2 : 3 amino adds changes are carried out at positions 50 (as in Mutant 1). 54 [firom Leu (L) to 
Arg (R)] and 55 [from Glu (E) to Ala (A)]. 

3) Mutant 3: contains the changes in Mutant 2, and at position 68 Arg (R) is changed to Gly (G). 

From this set of mutants, effects can be seen on binding by (1) Arg 68 alone (by comparing results firom 
40 Mutants 3 and 2) and (2) Leu 54 and Glu 55 (by comparing results firom Mutants 2 and 1). 

Total cellular RNA was extracted from a 60.3 produdng cell-line according to the method of Davis et al 
(1966). The first strand cDNA was synthesized using a cDHA synthesis kit firom InWrogen and an oligo (dT) 
primer. The cDNAwas then amplified by polyermase chain reaction (PGR) using degenerate primers (Larrick, 
et al.(1991) Scand. J.. Immun. 32 :121-128: Colloma and Larrick (1991) Bk>technk|ues 11 : 152-156). For the 
45 heavy chain, the sense primers (MH-SP-ALT.1 and MH-SP-SLT.2) were from the signal peptide and had an 
Xho I restriction site and the 5* end. The antisense primer (MH- gamma-CONST) was a consensus sequence 
frcMTi the CHI domain of murine y heavy chains and had a Pst I site at the 3' end. 

For the PGR amplification of the light chain, the sense primers (EcoRI/FRI-ML (k)) were firom the 5' end 
of the first framework reg'on and had an Xho I restriction site at the 5' end. The antisense primer (Hindlll/M1(k- 
50 )CONST) was firom the k constant regbn. but had a Sal I restriction site instead of the Hind III site described 
by Larrick etal. 

For both the heavy and light chains, the PGR products were either restricted (Xho I and Pst I for the heavy 
chain; Sal I and Xho I for the light chain) and doned into similarly r^bicted pUC 18 or treated with nudeotide 
kinase followed by blunt ended ligation into Sma I digested pUC 18. Ligation products were used to transform 
55 competent DHa E. cdi cells. 

CI nes containing inserts were selected using X-Gal/PTG; positive dones were screened for appropriately 
sized EooR l-Sal I restricti n fragm nts. EcoR I and Sal I flank the doning sites in pUG 18 and are therefore 
expected to release the PGR product producing an approximately 0.5kh firagntent Sel cted-d nes were sub- 
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jected to the double stranded DNA sequendng (Hsiao (1991) Nud. Acid Res 19 :2787) using Sequenase 
(U.S.Biochemical). Th sequ nee is sh wn in Figures 10 and 12, and Sequ nee I.D. numbers 9 and 10. Th 
Vg n sequ nces were compared to sequences of other murin Ig genes (Kabat et al, (1987) Sequ ncesof 
Proteins of Immunological Interest, 4th ed.. Nat Inst of Health, Bethesda. MD.) The heavy chain was found 

5 tobel ngtoth VH I la subgroup and the light chain to bel ng to th Vk II lb subgroup. 

In order to be sure that the correct V genes had been doned and sequenced, heavy and light chain from 
purif »d 60.3 were subjected to IM-terminal amino add sequendng. The amino add sequence of the heavy chain 
was identical to that deduced from the DNA sequence. However, there was a discrepancy for the 7th and 8th 
amino adds of the L chain. For the DNA, these amino adds (Tyr and Gin, respectively) are encoded by the 

10 sense primer used for the PCR reaction. Ser and Pro were found at these positions by amino add sequencing. 
Furthermore, almost all other Vk genes of this sut>group had Ser and Pro at this position. It was conduded 
that the primers used were not entirely appropriate for the V gene. They were however, similar enough to the 
cDNA that annealing and priming could occur. The codons for these 4 amino acids occur at the 3' ertd of the 
primer and are as follows: 

15 Tyr Gin : TA(CVT) CA(A/G) 

ScrPro : TCX CCX 

To determine the real sequence at this position, the entire procedure (cDNA synthesis, PCR ampfrfying, 
doning. and sequendng) was repeated using primers which terminated before the codons in question. This 
20 showed that Ser and Pro were encoded at positions 7 and 8, respectively. The initial PCR product was reP- 
CRed, using a sense printer which encoded Ser and Pro rather than Tyr and Gin at positions 7 and 8. 

For each V gene, 2 PCR primers were synthesized. The amplification of genes by PCR. doning into Pud 8, 
as well as the double stranded DNA sequendng were all done as described above. The PCRed V genes were 
then doned into the expressk>n vectors PNy 1.16 and PGkll which have human constant regions (Fig. 9 and 
25 11) before transfectton into the mouse myeloma cell line, Ag8.653. 

The VL and VH genes were Inserted into pGk.11 and pNyl.ie. respectively, by amplifying the genes by 
PCR adding restrictton site and intron sequences at the 5' and 3' ends. 

The sense primers for both the H and L chains contained within their sequences the following in the 5' - 
3' direction: 
30 1. N6, 

2. restrictk>n srte for doning (EcoRI and Sad for the H chain; EcoRI and Hindlll for the L chain; EcoRI is 
for doning into pUC18, Sad and Hindm are for doning into the expression vectors), 

3. branch point lariat signal. 

4. polypyrimidine tract, 
35 5. splice acceptor site. 

6. leader peptide 2, 

7. the beginning of the FR1 of the V gene. 

The reverse complement of the antisense primer contained: 
1 . 3' end of VDJ gene for the H chain and of the VJ gene for the L chain. 
40 2. splice donor signal. 

3. restrictk>n site for doning (Sail and Xhol for both H and L chains; Sail Is for doning into pUC 18, Xhol 
is for doning into the expression vectors), 

4. N6. 

The sense primer for the heavy chain was: 

45 

S'aeGAATTCGAGCICITITTCTGATAAaj^^ 
GTCCAGTGTCAGGTCCAACTKLVGCAGCCnXj^ 



50 The anti-sense primer for the heavy chain was: 

s'aggtcgacckxsagitnxsaggactcacctgaggagacggtg^ 
gcx:t3' 

The sense primer for the light chain was: 

55 

5 •agGAATTCAAGinTrcCKj ACTACATGAGTGCATTTCT^ 
ATTrcAGATACCACX:GGAGACATKnXKn'AACACAATm 
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The anti-sens primer for the light chain was: 

S'A^GTCGACCTCGAGATCACITACGTITGATTrcCAGClTGGl^ 



The amplrf ication of genes by PGR, doning into pUC18 and the double stranded DNA sequencing were 
done as described above. The PC Red V genes were the cloned into the expression vectors. For the heavy 
chain, the V gene was directionally doned into the Sac I and Xho sites of pNyl .16. For the light chain, the V 
10 gene was directionally doned into the Hind III and Xho I sites of pGk.11 . 

Humanized VH and VL genes were constructed by dlgos and ts described In detail herein below. The in- 
sertion of humanized Vh and Vl genes into pMr1.16 and pGk.11, respectively were done by the procedures 
described for the chimeric Ab. The sequences needed for doning and expression were built in (or induded in ) 
in oligo#1 and #10. 

15 The presence of the chimeric 60.3 Abs were detected by ELISA In this assay, 96 well plates were coated 
with goat anti-hunuin IgG. The chimeric 60.3 in the sample which bound to the plates were detected by using 
horse radish peroxidase (HRPO) conjugated human anti-kappa IgG. Purified chinneric L6 (an unrelated anti- 
tumor antibody) was used as a standard. The culture supernatants from Ab produdng dones were selected 
for binding in FACS binding assays. 

20 FACS Binding Assays 

HL60 is a human myelomonocytic cell line which expresses CD 1 8 on the cell surface. 
HL60 target cells, grown in culture media (DM EM, 1 % fetal calf serum and 2% L-Glutamine). were made Wtm\ 
in the culture containing 0.1 % NaAzide at 4''C. 1 0^ cells were used per binding assay. Bovine IgG (Sigma, 20 
ug/ml final concentratton) was added for 1 0 min. at 4*'C to mask the Fc receptors before performing the assays. 

25 Binding assays were done as direct binding, pre-incubation or competition assays. 

1) Standard binding assays 

Various anK>unts of murine, chimeric or humanized 60.3 were incubated for 45 minutes with HL60 cells, 
washed and then incubated with FITC-labeled goat anti-nnouse (for murine 60.3) antibody (Tago, Inc.) at 
1:50 dDution or FfTC-labeled goat anti-human antibody (for chimeric and humanized 60.3.) The fluores- 
30 cence intensity was then monitored. 

2) Pre-incubation assays 

Pre-incubation experiments were done with murine 60.3. For pre-incubation with either chimeric 60.3, or 
humanized 60.3, 1000 ngAnI of either c60.3 or h60.3 was added to the HL60 cells for 45 minutes to 1 hr 
at 4"^. Cells were then washed twice with 1 ml of k:e cold NaAzkJe-containing culture media and centrrf- 
35 uged in a Bedonan table top centrifuge (Model TJ-6) at 1000 rpm for 5 min. After decanting washes, cells 
were resuspended in various concentrations of FITC-conjugated murine 60.3 Abs and incubated on ice for 
45 min. to 1 hr. with intermittent mixing at lOmin intervals. Cell pellets were fbced in 300 ul of 1 % paraf- 
ormaldehyde at 4''C before analysis of FACS machines. 

3) Co-incubation assays 

40 Co-incubatk>n assays were carried out by incubating various concentrations of chimeric 60.3 or humanized 
60.3 with a saturating concentration (1 ug) of FITC-conjugated murine 60.3 Abs. Both types of At>s were 
added to the HL60 cells simultaneou^y; thus eliminating one incubatk)n step. Incubation, washing and cerv 
trifiigatton were all done as in the pre-incubation assays. The cells were also suspended in 1 % k:e cold 
paraformaldehyde before analysis by FACS. 
45 The mean channel values obtained from FACS analyses were translated into linear fluorescence equiva- 
lence (LFE)va!ues. The LFE values were further used to calculate the fluorescence intensity levels (FIL) ac- 
cording to the following equation: 

FIL = LFE of sample/LFE of negative control (no second antibody) 
Electroporation of cells was carried out on a BioRad electroporator, set on capacitance of about 960 fu 
50 and .25 volts. A count of viable cells was taken before starting. Cells were at least 90% viable for use. Cells 
were also in the 4-6 x 10^ /ml range; if they are overgrown, they wfll not show high transfectk>n efficiency. 1 
X 107 cells/electroporation group were renxwed and one group was used for a control electroporation and one 
for an "unzapped" control. Cells were centrifiiged at 1000 rpm for 10 minutes. Supernatant was removed by 
vacuum with a sterile, unplugged pasteur pipet and the pellet was resuspended in as large a vdume of PBS 
55 as th tube will allow and washed again. 

Th pell t was resusp nded in 0.8 ml PBS per 1 x 10^ cells and 0.8 ml aliquot were added into labeled 
CUV ttes. 10 ug each of DNA was added to cuvette. Incubate cuvette on ice for 10 minutes aft^ mixing. Th 
CUV ttes were electroporated n ting tinte factor reading and put on ice for 10 minutes, and then transferred 
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into 19 mis of IMDM/10% FBS. using 1 ml of medium to wash them out of the cuvettes. Cells were at 37° C 
for 48 hours, and th n plated at 10^, 3 x lO^. and 1 x iO^ cells per ml In IMDM with 10% FBS and fed for 2-3 
weeks before sere ning. 

EnzyPDe-linked inrmiunos rptton assays (EUSA) were carried out as is known in the art In an illustFative 
5 embodiment fEUSAInth present in venti n wasp rformed as follows. Plates were created by d fluting g at 
anti-hunruin IgG 1:10,000 with 0.05M carbonate buffer, pH9.6 and transferring 100 ul to each well of a 96 well 
microtiter plate . The plate was then incubated at 4*^0 for 12 to 16 hours. The plates were then rinsed 1 to 3x 
and specimen diluent was added. The plates were then incubated at room temperature for 1 hr and rinsed 3x. 
On separate plates, 30 ul of cultural supernatant was diluted to 300 ul with the specimen diluent and 50 ul was 
10 transferred to the previously coated plates. 50 ul of specimen diluent was added and maintained at room temp 
for 1 to 2 hours and rinsed 3x. HRPO conjugated human anti kappa was diluted with conjugate diluent 1:5,000 
and 100 ul was added per well to plates and incubated at 37*'C for 30 minutes and washed 3x. A chronr^agen 
(1:300) with huffier substrate, pH 5.5 (room temperature) and lOOul was added to the plates. The plates were 
incubated at room temperature for 15 minutes, 100 ul of 3M H2SO4 was added and the plates were read at a 
15 wavelength of 450 nm and 630 nm. 

Chimeric and humanized Mab 60.3 ware analyzed using size exclusion HPLC (secHPLC), sodium dode- 
cylsulfate-polyacrylamkle gel electrophoresis (SDS-PAGE), and isoelectric focusing (lEF). Test samples were 
compared to a murine 60.3. 

20 secHPLC 

ATSK3000SW Spherogel 7.5 x 600 mm column nruinufactured by Toso Haas was used. The nmbile phase 
was 0.05 M phosphate buffered saline and samples were eluted at 0.5 ml/minute for 60 minutes. The chroma- 
togranvs show that the test lots have multiple contaminating peaks and a major peak eluting at about 27.5 min- 
25 utes. The major peak observed with the reference murine 60.3 Mac chromatogram, eluted at about 26.7 min- 
utes. 

SDS-PAGE 

30 SDS-PAGE was perforn^ed using a 4-20% gradient gel and bands were detected by Coonr^ssie blue stain- 
ing. All lots were compared to the murine reference lot. Samples were run both reduced with 2-mercaptoetha- 
nol and non-reduced. Non-reduced gels showed that the major band ran consistent with the reference stan- 
dard. The reduced gel showed data consistent with heavy and light chain separatnn. 

35 lEF 

Isoelecfaic focusing was performed using precast gels with a pH range of 3-10. Both 2 ug and 5ug of sample 
were applied and run at 100 volts for 60 minutes, 200 volts for 60 minutes, and 500 volts for 30 minutes. Gels 
were stained with Coomassie Blue R-250. The chimeric 60.3 sample had no visit>le bands. This suggests that 
40 the material dkj not migrate into the gel. The humanized 60.3 sample had visible bands at the top of the gel 
in a regton difficult to determine pi's. This data suggest that the chimeric and humanized lots of antibody have 
pi's greater than 8.0 as determined by the reference nriarkers. This is in contrast to the murine mAb 60.3 which 
has a pi range of 6.8 - 7.5. 

Having described this invention and embodiments thereof, the present invention is further Slustrated by 
45 the fdbwing Examples whbh are not intended to limit the scope of the invention. 

EXAMPLE 1: Comparative Model Building 

The structural templates for comparative model buOding of murine 60.3 were determined. The Brookhaven 
50 Protein Data Bank was searched for t he sequences wit h known structure which are nrK)st homologous to nrHJrine 
60.3 heavy and light chains. The antiphosphochdine murine myeloma antibody, McPC603 (Satow et al. (1 986) 
J. Md. Bid. 190 : 593-604) was found to be the most homologous to the light chain, with 68% sequence iden- 
tity/homology. The anti-p-azophenylarsonate murine monodonal lgG2, R19.9 (Lascombe et al (1989) Proc. 
Natl. Acad. Sd. USA 86 : 607-611). was found to be the nrK>st homdogous to the heavy chain, with 59% se- 
55 quencekientity/honruxlogy. MCPC603VL and R19.9VH were combined in rder to obtain th structural tem- 
plate for nrKxjel building of murin and humaniz d 60.3. 

Th CDR k>ops and known structural determinants of murine 60.3 were d termined. Three CDR loops in 
60.3 can be directiy assigned to known can nical types (Chothia t al (1989) Nature 342 : 377-383). These 
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are L2 (type 1), L3 (type 1) and H1 (type 1). The remaining CDR ioops do not t>elong to known can nical struc- 
ture types. The boundaries of these loops can be d termined by aligning th 60.3 sequ nee with that of the 
structural model. These assignments are shown al ng with thos of L2, L3 and HI in Figs. 1 and 2. Th fra- 
mework residues which are crucial for the conformations of the CDR loops are also shown in Figs. 1 and 2 
5 { for xampi , « L1 indicates structural det rminants for the LI loop) and in Sequence I.D. Numbers 2, 4, 6, 
and 8. 

A model of murine 60.3 was then tHiilt Backbone loop templates for L2 and L3 were taken from McPC603, 
and HI from R19.9. Li was initially modeled by a two resklue deletion of the LI loop of McPC603. H2 was 
found to be similar to the corresponding loop region in R19.9 , which may represent a not yet classified can- 
10 onica) nK>tif. 

No loop closely related to H3 was found in the Brookhaven database. As a initial approxinDation for H3, an 
antibody non-CDR loop of the same size as 60.3 H3 (defined here as 96 to 102 in the Kabat numbering scheme 
or as H99- H 109 in the continuous sequence) was used. The backbone template for this loop was a 13 residue 
segnrmnt of the antibody NEW (Pdijak RE. et al. (1974) Proc. Natl. Acad. Sci. USA 71 : 3340), beginning at 
IS residue LB. The loop was selected because it has the sanne length and showed a reasonable fit into the ad- 
jacent framework of H3. Energy minimization/conformation refinement of murine 60.3 resulted in: 1) residual 
rms derivatives of the energy functton : 0.63 kcal/nK>l A 2) Backbone rms deviatk>ns from the initial crystal 
coordinates: Vi.: 0.86 A ; VhiI .17 A. 

The nrK>st homok>gous human variable regk)n sequences were found. The human sequence most homo- 
20 logous to 60.3 V|. is PIR Accession # A0 1 900 (sequence kientity/horrK)logy 66%). This is the Vg germline se- 
quence described by Pech and Zachau (Pech, M. and Zachau, H.G. (1 984) Nucleic Adds Res. 12 : 9229-9236). 
Vg bek>ngs to human Vk subgroup Ilia. 

The human sequence nrast homologous to 60.3 Vh is PIR Accessk>n # A32483 (honrwiogy 59%). This is 
the heavy chain from human monodonal Ab MO30 (anti-HIV gp 120) (Larrick et al. (1989) BBRC 1^ 1250- 
25 1256). TTiere are two germline sequences highly honrK>logous (1 aa mismatch through FR3) to MO30 : 21-2 
and 3-1 (Berman et al (1988) EMBO J. 7: 727-738). TTiese sequences belong to human VH subgroup 1. 

The structural template for murine 60.3 was confirmed to be suitable for the human template. The human 
template for the heavy chain (MO30) is 56% honrK>logous to R19.9. TTie human template for the light chain (Vg) 
is 62% honnologous to McPC603. These numbers are similar to the honiology between murine 60.3 and the 
30 same structural templates. The CDR loops and the structural determinants for the human template are shown 
in Figs. 1 and 2. The CDR loops and known structural determinants were then grafted onto the human template 
(Jones et al. (1986) Nature 321 522-535). The results are shown in Figures 1 and 2 in the column marked 
Phase I h60.3. In these figures, the h60.3 sequences and all klentical sequences from the other columns are 
shaded. 

35 A Phase 1 nfKxiel of humanized 60.3 was then built using the same nK3del tnjilding protocol as for nxjrine 
60.3. The backbone CDR loop templates were the same as for m60.3. Energy minimtzatk>n/conformational re- 
finement of murine 60.3 and Phase I h60.3 resulted in: 
residual rms derivatives of the energy function: 
m60.3 : 0.63 kcalAnolA 

40 Phase 1 h60.3 : 0.58 kcalMK)lA 

Backbone rnts deviattons from the initial crystal coordinates: 
m60.3 model Vl: 0.86 A; Vh: 1 .1 7 A 

Phase I h60.3 Vt: 0.96 A; Vh: 1 .1 5 A 

In Phase 11, the murine and Phase I h60.3 models were compared and refined. A comparison of the models 
45 of nturine and hunnanized 60.3 gave an rms deviatton of 0.66 A and 0.78 A for V|. and Vh, respectively. As a 
result of the comparison, it was postulated that certain residues which were still "human" were important for 
the conformatk>n of the CDR loops. These were therefore changed to the nmjrine residues, and are shown in 
bold in the column nruirked Phase ll/lll h60.3 in Figures 1 and 2. Sequence comparisons of m60.3 and Phase 
II 60.3 models (including the CDR loops) gave the following honx)k>gies: 
50 Phase II 60.3 Vl : 

vs.murine 60.3 Vt 82% 
vs. human template Vt, 82% 

Phase II h60.3VH 
vs. murine 60.3 Vh 78% 
55 vs. human template Vh 81 % 
Phase II h60.3 Fy 
vs. nrujrine 60.3 Fv 80% 
vs. human template Fy 81% 
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In Phase III further improvem nts of the structural model of h60.3 were mad .After further reflnem ntof 
th murin and humanized models, th following param ters were btained: 

Rnal rms derivatives of th energy function: 
murine 60.3 model: 0.55 kcal/mol A 
5 Phase III h60.3: 0.78 kcalMiol A 

Backbone rms deviatk)ns from the initial crystal coordinates: 
murine V|.: 0.82 A. Vh: 1 .09 A Fy: 0.98 A 

Phase III h60.3 Vl: 0.88 A, Vh: 1 .07 A Fy: 1 .00 A 

Finally, in Phase IV, the binding site features of the Phase III h60.3 nuxJel were analyzed. When comparing 
10 murine vs. humanized models at prevnus stages, the emphasis was ntore on the comparisons of the CDR - 
framework interactk>ns in the murine antibody and the humanized contacts. The Phase IV nrxxlel, where the 
Li and H3 loops were remodeled using conformational search, allows a more detaOed analysis of the CDR 
surface than the prevnus models. 

Analyss of the nrKxlel suggests that 60.3 is a distinct groove-type antibody and that certain COR loops 
15 (L2 and HI) may not be involved in antigen binding. In the Phase IV model, the amino acids in these loops 
have been changed to the sequences from the human template. 

EXAMPLE 2 

20 CONSTRUCTION OF THE HUMANIZED 60.3 Vh AND Vl GENES 

Construction of the humanized 60.3 V genes was done according to the modeling methodology described 
in Example 1 by piedng 5 pairs of complementary oligos together (see Figures 3 and 4 and Sequence I.D. 
Numbers 11 to 30) each oligo was about 90 to 100 nt In length, together they span the entire V regions, forming 
25 the Vh and V|_ The step by step protocol used was as follows. 

1. Five mioofiige tubes were labeled and the following oligos (10 ng each, 10 ul) were added: 

tube 1: oligo #2 
tube 2: oligo #3 and #4 
tube 3: oligo #5 and #6 
30 tube4:oligo#7and#8 
tube 5 : oligo #9 

2. Tubes 2, 3 and 4 were heated to lOO^'C for 3 min. and then cooled slowly to room temp. 

3. To all 5 tubes, iOx kinase buffer (40 m M Tris. CI pH7.5, 10 mM MgCl2. lOmMDTT, lug DNA. 0.5 mMATP, 
50 ug/nti bovine serum algumin and 1 "weiss" unit T4 DNAIigase), 100 uM ATP (1 ul) and nucleotide kinase 

35 were added to phosphorylate the 5' ends of the oligos. One Weiss unit is equivalent to 60 cohesive-end 
units. Reactions proceeded at 37*" for 1 hr. 

4. The tubes were extracted with phenol/CHCIs and precipitated with ethanol. 

5. Oligos #1 and #10 (lOng, 10 ul) were added to tubes 1 and 5, respectively. Tubes 1 and 5 were mixed 
and step 2 was repeated for tut>es 1 and 5. 

40 6. The contents from all 5 tubes were pooled into a single tube. 

7. Ugate at 12° C, for 12 to 16 hours in a vol. of 25 ul. 

8. Analyze 2 ul on 1.0 % agarose. 

9. After successful ligation, restriction digest an alkfuot with EcoRI and Sail for 45 min. at 37''C. 

10. Apply onto 3.0% low melt agarose gel and cut out the correct sized band (approximately 0.4kb). 
45 11. Ugate into Puc 18 (pre-digested with EcoRI and Sail). 

12. Transform DH5a cells. 

13. Select the potential positives based on Xgal/IPTG indicators. 

14 Miniprep cultures were prepared and maintained for 12 to 16 hours. 

1 5. Rasmid DNA was isolated from these cultures and the insert sizes were checked by cutting with EcoRI 
50 and Sail. 

1 6. The plasmid DNA was sequenced for verif k^ation. Several of the clones had mutations such as single 
base deletions. 

17. The synthetic H and L variable genes were cloned into appropriate expression vectors (pNG1.16 and 
pGk. 11, respectively.) 

55 1 8. Th potential positives were isolated after selection on ampictllin. 

19. Steps 14 to 16 were repeated. 

20. Transfection into mouse Ag8.653 myeloma cells was carried out, followed by selecti n with G418 (Raff, 
tel. . (1991) J. Infect Dis. 1^ : 346-354). 
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21 . Ig positives were then screened with EUSA (ganrvna, kappa capture). The DNA sequences of th nmjr- 
ine 60.3 heavy and light chains are shown in S qu nee I.D. numt>efs 5 and 7, respectively; and th DNA 
sequences for the humanized 60.3 l-l and L chains are shown in Sequence I.D. numbers 1 and 3, respec- 
tively. 

5 

EXAMPLES 

BINDING ASSAYS OF 60.3 

10 The binding activity of the humanized 60.3 antibody was n^easured by preincubation, competition and 
chemilluminescence assays. 

HL60 is a human nr«yelonx>nocytic cell line which expresses CD 18 on the cell surface. 
HL60 target cells, grown In culture media (DMEM, 1% fetal calf serum and 2% L-Glutamine), were made lO^/ml 
in the culture media containing 0.1 % NaAzide at A^'C. 1 0^ cells were used per binding assay. Bovine IgG (Sigma, 
15 20 ug/ml final concentration) was added for 1 0 minutes at 4''C to mask the Fc receptors before performing the 
assays. Three types of binding assays were performed. 

A) Standard curves 

20 As illustrated in Figure 5, various amounts of murine, chimeric, and humanized 60.3 were Incubated for 45 
minutes with HL60 cells. The cells were washed and then Incubated for 45 minutes with either -FITC conju- 
gated goat anti mouse IgG (for m60.3) or FITC conjugated goat anti human IgG (for c60.3 and h60.3). Excess 
antibody was washed off and the cells were fixed with 1 % paraformaldehyde and assayed by FACS. For each 
curve, the value obtained at 1500 ng/lml is taken as 100%. Data at other concentrations are plotted as % of 

25 this value. As can be seen in Figure 5, all three antit>odies titrate over approxipDately the same range., indicating 
similar affinities (known to be at>out 10^ for m60.3). 

B) Preincubation experiments 

30 HL60 cells were preincu bated for 45 minutes with 1 ^g/ml of either cGO.S or h60.3. The indk:ated amount 
of FITC conjugated mSO.S (or no antibody) was then added and the cells were incubated for another 45 min- 
utes. After washing, the cells were fbced with paraformaldehyde and assayed by FACS. As can be seen in Fig- 
ure 6, both the chimeric and humanized antibodies were able to completely block the binding of FITC - mSO.S 
to HL60 cells (note FIL = 1 is equivalent to no binding). 

35 

C) Competitk)n experiments 

As Olustrated in Figure 7, 1 (ig of FITC conjugated m60.3 and the indicated amount of either c60.3 or h60.3 
were coincu bated with ML60 cells for 45 minutes. The cells were washed, fbced with paraformakiehyde and 
40 assayed by FACS. As can be seen in Figure 7, the chimeric and humanized antit>ody competed equivalently 
in this assay. The dashed line shows binding of 1 ^g FITC - m60.3 in absence of competitor. 

3) Chemilluminescence Assay 

45 An assessmentof activity of 60.3 by inhibitk>n of Zymosan-induced, luminol-enhanced chemBuminescence 
of PMN was carried out The material and compositk>ns and procedures used were as follows. 

PrBpamnon of Components: 

so 1. GGVB 

1.1. Materials 

a) 5x Veronal: Dssdve 41 .2g NaCI and 5.095g 5,5-diethylbarfoiturate (Paragon B-2 buffer) in 700 ml deop- 
55 moz d(diH20). Adjust pH to 7.35 ;^ 0.05 with IN Ha. Bring volum to 1 1 iter with diH20. Filter steriliz and 

store at 4°C. StabOity but is at least 2 months. 

b) Stock m tals: mix equal volumes f 2M MgCIs (40.66g/1 00ml) and 0.3M CaQj (4.4g/1 00ml). FOter ster- 
ilize and store at 4''C. StabOity is at least 6 nrK>nths. 
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c) gelatin 

d) dextrose 

1.2. Procedure 

5 

For300ml6GVB: 

Add 0.3g gelatin and 0.3g dextrose to 240 ml diH20. Heat with mixing until just dissolved. Let cool to 
t>elow 37C. 

Add 60 ml 5x Veronal and 0.3 ml stock metals. Filter sterilize and store at 4C. 
10 This is made up fresh for each da/s assays. 

2.2. Zymosan 

2.1. Materials: 

IS 

a) Zymosan A, SigmaZ-4250 

b) diH20 

c) 1 x PBS 

20 2.2. Procedure: 

Suspend zynrwsan to 25 mg/ml (1.5g/60ml) In diH20. Heat in glass in a water bath at 100C for 60 min. 
Transfer to 50 ml polypropylene centrifuge tubes. Centrifuge (8500 rpm in the TJ-6, 10 min, RT) and 
wash twice with 1 x PBS. Resuspend to 50 mg/ml (1 .5g/30ml) in 1 x PBS. Store at 4C. Stability is at least 1- 
25 2 months at 4C. 

3. Complement (adsorbed human serum) 

3.1. Material: 

30 

a) Freshly drawn human biood, without anticoagulants. Type O may be best, or use the same donor that 
supplies PMN. Transfer blood immediately after drawing into sterile 50ml glass centrifuge tubes (avaOable 
from blood lab). 

b) 0.1 M EDTA. pH 7.35. Dissolve 3.72g NA2H2 EDTA in 80 ml di H20, with mixing. Adjust pH to 7.35 ± 
35 0.05 with freshly prepared 2N NaOH. Bring to 100 ml volume with diH20. 

c) Zymosan prep (see 2.1) 

3.2. Procedure: 

40 Allow blood to dot at room temperature for 1 hour. Rim the tube with a glass pipet to contract the dot 

Centrifuge (8500 RPM in TJ-6. 20 min., RT) and carefully remove serum to a sterile polypropylene tube. Repeat 
centifugation if necessary. 

Add 0. 1 M EDTA to serum to 1 0% mg/ml zymosan needed to adsorb serum at 0.2 wq zymosan/ml of serum. 
Add this volume of zymosan to each of four centrifuge tubes. Centrifuge zynK>san tubes (8500 rpm in the TJ- 
45 6, 10 min,. RT) and renrK>ve supernatant Keep the tubes on ice. 

Resuspend the zymosan pellet of one tube with a small anrwunt of serum. Add the reaminder of the 
serum and mix by inversion. 

Incubate the serum/zynmsan mixture on a rocker or rotator, 30 min, 4C. 
Centrifuge as atx>ve. Repeat the adsorption with the other three tubes. 
50 After the final centrifugation, fQterthe serum through a 0.45um syringe fitter Aliquot into microfuge tubes 

or equivalent and store at -70C. The adsort)ed serum has an unknown stability but it probably good for at least 
six nK)nths. 

4 Human PMN 

55 

4.1. Materials 

a) Freshly drawn h parinized blood. 

13 
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b) Rcoll-Hypaque Mono-Pdy Resolving Medium (MPRM), Row Labs no. 16-980- 49 
0) 1 X PBS 

d) 3% HoAc in diH20 

e) Serunvfree, phenol red-free RPMI with 0.5% gelatin (RPMI-g I): Add gelatin (1.25g/250 ml) to serum- 
5 free, ph nd red-free RPMI and h at with stirring until gelatin is just dissolved. Filter steriliz (0.45 um f Oter) 

and store in 50ml aliquots at 4C. Stability is least 2 weeks. 

4.2. Procedure 

10 Aliquot 4 ml MPRM into 1 5 ml polypropylene centrifuge tubes. 

Overlay gently with 5 ml fresh blood. 

Centrifuge (30 min, 2000 rpm in TJ-6, RT). If RBC are not completely pelleted, centrifuge an additional 
10 minutes. If no RBC sedimentation has occurred, additional centrifugation wOl not work and another donor 
must be found. 

IS Aspirate top (plasma) layer and first layer of cells (nK)nocytes). 

With a Pasteur pipet, carefully renxive the second cell layer (PMN) to a 50ml centrifuge tube. Repeat 
with all tubes. 

Add RT at room temperature. 1 x PBS to the PMN tube to a final volume of 50 ml and mbc gently. 

Remove 50ul and dilute appropriately in 3% HoAc (usually 1 :20 is appropriate for a prep of 30ml whole 
20 blood) and count PMN in a henrKxrytometer. Calculate total PMN present in the 50ml tube. 

Centrifuge PMN (10 min, 2000 rpm in TJ-6, RT) and resuspend to desired density (normally 2 x 10%nl) 
in RPMI-gel. Store at RT, swirling gently to resuspend occasionally (once or twice an hour). Viability should 
be > 90% after 6 hours; if used after 6-8 hours, check viability by trypan blue exclusion prior to use. 

25 5 Lumind 

5.1. Materials 

a) Luminol (5 amino-2,3 dihydro - 1,4 - phthalazinedione), Signrta no. A-8511. 
30 FW=172.2 

b) DMSO. chromatography grade 

5.2. Procedure 

35 Dissolve luminol to 10^ in DMSO (17.7 mg/IOml). Store at 4C in the dark (wrap tube in foil). StabOity 

is more than 1 month; make fresh 3-4 weeks. 

6. Antibody 

40 6. 1 . Materials and Procedure 

Dilute antibody in GGVB to appropriate concentrations. 
Assay Design 

45 

Test volumes per tube are normally 1 ml, composed of the following. 

- 100 ul antibody (sensitivity to concentration is unknown, previous work has t>een with 5 ug/ml preps) 

- 100 ul PMN (normal concentration at 2 x lO^/ml) 

- mbc by swirling gently and incubate 30 min. at room temperature (RT) (cover all tubes with one sheet of 
50 ParafDm) 

- 100 ul Zymosan (high concentrations, around 50mgAml.seem to work best) 

- ji^ before loading luminometer, add: 

- 600 ul luminol (dQuted in GGVB , normal concentratk)ns are lO-'^ or 5 x 10~ ^M) 

- 100 ul complement (k>w concentrations, on the order of 1-2%, seem to work better) 

55 

Staging of Assay 



Set up CL tubes (Clinicon 2174-089, avaOaU through LKB). Keep in the dark (in a drawer) to prevent spon- 
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taneous luminescence from absorbed fluorescent light 

Thaw complement at RT and hold on ice, just prior to assay complement may be unstabi (may lost 50% of 
activity in 6-8 hours) at 4C. 

Add antibody, PMN, and zymosan as described above to CL tubes. 
5 Tak tubes to luminometer. Program assay parameter into controlling computer. 

Prepare final dilution of Lumind (and keep wrapped in foil) just before adding to CL tut>es. 
Prepare final dilution of complement in GGVB. 

Add lumind and complement to CL tubes as described above and load into luminometer. Start the program 
immediately. Peak luminescence is reached 4-5 minutes after adding complement 
10 Throughout assay setup and during the first rotation of t he CL tubes In the luminometer, apply anti-static charge 
with the anti-static gun to prevent CL tubes hanging up in the luminometer. 

Results 

15 The above procedure was utilized to measure the ability of murine, chimeric, and humanized 60.3 to inhibit 
CR3(CDIIb/CD18) medk^ated up take of opsonized zyntosan by neutrophils. In the absence of 60.3. phagocytk: 
uptake of opsonized zynK>san results in an increase in hexase monophosphate shunt activity that is measured 
as light output by luminol-enhanced chemiluminescence. 
% inhibition of chemiluminescence signal = 100 x ( 1 - signal of sample/signal of negative control anti- 

20 body) 

The results shown in Figure 8 illustrate that all of the 60.3 antibodies were reactive in this assay, while an ir- 
relevant antibody (murine L6 anti-tumor antibody) showed no reactivity. 

EXAMPLE 4 

25 

CONSTRUCTION OF pGK.11 

In order to express either chimeric or humanized light chains, cassette vectors were constructed capable 
of expressing variable regkin genes, synthesized using PCR (polymerase chain reactk)n). In additk>n to se- 
30 quences found on pSV2-gpt these vectors contain an Ig promoter and leader, the 5' portion of an intron, a 
MCS for insertion of the variable regk>n gene, the mouse heavy chain enhancer (MHE), the 3' portion of an 
intron and the human Ck gene. The variable region was PCRed so that in addition to the coding region it con- 
tained flanking Intron sequences. 

The cassette for the expressk>n of the light chain was constructed as follows (and is illustrated in Rgura 

35 9): 

1. The 121 bp Hind III to Bgl II fragment of pSV2-gpt (Mulligan and Berg (1980) Science 1422) was 
deleted from the 5' end of the Ecogpt by restriction with Hind III and Bgl II, filling in with Klenow polymerase, 
and religation. The product, pG.2, was detected by the absence of Hind III and Bgl II sites and linearization 
with EcoR I. 

40 2. The pBR322 part of pG.2 (EcoR l-Pvu II fragment) was replaced with the analogous portion of pUC 1 8 
(Pvu II - Pvu II fragment) to form pG.3 This product was screened for by digestion with Pst I, which linearizes 
pG.3 giving a 5.05 kb fragment 

3. pG.3 was made into pG.5 by the replacement of the 750 bp EcoR I to BamH I fragment with the 64 bp 
EcoR I to BamH I multiple cloning site (MCS) from plC20R (Marsh et al. 1984). pG.5 was screened for the 

45 presence of an Xho I site (part of the MCS). 

4. A Not I site was inserted in the Nde I site of pG.5 to form pG.12. Oligonucleotide linkers (Sequence I.D. 
numbers 33 and 34) were used for this purpose. pG.12 was screened for the presence of a Not I site. 

5. A 2.75 kb EcoR I fragment containing the human Ck gene was inserted into the EcoR I site of the MCS 
of pG.12 to form pGk.3. When the Ck fragment was in the correct orientation, Sac I digestion produced 

so 0.126, 0.509 and 6.7 kb fragn^nts (vs. 0.509, 2.1 and 4.7 kb fragments in the wrong orientation). 

6. A 140 bp portion of the SV140 enhancer was removed from pG.3 by restriction with Sph I, destroying 
the overhang with the exonudease activity in Klenow polymerase, followed by digestion with Pvu II and 
blunt end ligation. The product, pG.9, was screened for the loss of Pvu II, Nsi I and Sph I sites. 

7. A Not I site was inserted in the Nde I site of pG.9 to form pG.10. Oligonucleotide linkers, described in 
55 #4 abov , were used for this purpose. pG.1 0 was screened for th presence of a Not I site. 

8. Th 195bpN 1 1 to BamH I fragm nt from pG. 12 was inserted into th N 1 1 to BamH I site of pG. 10 to 
form pG.11. This serv d to place an 879bp fragm nt from pG. 10 with a 195 base pair fragment containing 
a MCS. pG.11 was screened forth pres nee f Xho I site in the MCS regk>n. 
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9. The 3kb Nar I to Cla I fragment from pGk.3 was directionally subdon d into th sanre srtes of pG.11 to 
form pGk.4. 

1 0. A 1 kb fragment containing the nrKHise heavy chain nhancer was transferred from pICMH EXX to pGk.4 
as a eta I to Hind III fragment, thus forming pGk.5. EcoR I digestion of pGk.5. produced 0.3, 2.75 and 5.1 

5 kb fragments. ptCMHEXX was made by th insert! nofth IkbXbalfragm nt (filled in with Kl now poly- 

merase) from RBL 216 (Lang et al (1982) Nud. AckJ Res. 10: 611-620) into the filled in Bgl II site of pIC 
19 R (Marsh etal (1984) Gene 32: 481-486). 

11 . A 579 bp Sau3a I fragment containing the 4B9 promoter pGkAl .9 (Raff et al, 1991) was inserted into the 
BamH I site (in the MCS) of pGk.5 to form pGk.11. The resulting plasnrttds were screened for the correct ori- 

10 entatkm of the insert : BamH I plus Asp718 1 digestkm gave a 1.2 kb fragment in the correct orientatbn vs. a 
2.2 kb fragment in the wrong orientation. Also, BamH I plus Hind III dige^n shouM give a 0.58 kb fragment 
The sequence of specif ic regions is contained in the following segments. The sequence Is given in dock- 
wise orientation beginning at the EcoR I site at O"" and is illustrated in Figure 10 and Sequence I.D. 10. 
The ampidllin resistance gene is bp 7383 to 8241 
15 The ecogpt gene is bp 5651 to 61 07 

The mouse heavy chain enhancer is bp 2770 to 3788 

The SauSa I fragment containing the Alk promoter and leader is bp 3827 to 4393, with the leader peptkie en- 
coded by bp 3951 to 3999. 

The EcoR I fragment containing the human Ck gene is bp 6 to 2756, with the Ck regk)n itself encoded by bp 
20 2113 to 2435. 

EXAMPLE 5 

Construction of pNy1.16 

25 

In order to express either chimeric or humanized heavy chains, cassette vectors were constructed capable 
of expressing variable region genes synthesized using PGR (pdymerase chain reactbn). In addition to se- 
quences found on pSV2-neo, these vectors contain an Ig promoter and leader, the 5* portk>n of an intron, a 
MCS for insertion of the variable regton gene, the moue heavy chain enhancer (MHE), the 3' portion of an 
30 intron and the human G yl gene. The variable region gene is PCRed so that in addition to the coding region 
it contains flanking intron sequences. 

The cassette for the expression of the heavy chain was constructed as follows (as illustrated in Figure 11) 
1 . The Hind III site in pSV2-neo was removed by digestion with Hind III, fill in with Klenow polymerase and 
religation. The product, pN.1 was screened for the absence of the Hind ill site. 
35 2. The 750 bp EcoR I to BamH I fragment from pN.1 was replaced by a 64 bp EcoR I to BamH I multiple 
doning site (MCS) from plC20R to form pN.5 

3. A PGR region was done on pICMHEXX in a manner that primers were chosen to delete the EcoR I site 
while generating a new MCS regton. This resulted in a product consisting of: a) recognition sequences for 
EcoR V. Asp 718 1, Sac I and Xho I; b) a 695 bp of the rTK>use heavy chain enhancer from the 5'Xt>a I site 

40 to the EcoR I site; and c) recognition sequences for Hind III, Sal I and BamH I. This 723 bp PGR product 
was doned into p1G20R to form pMHE.per. 

4. The 723 EcoR V - BamH I fragment from pMHE.per was subdoned into the same sites in the MCS region 
of pN.5. This removed the previous MCS, while inserting the one assodated with the MHE. The product, 
pN.8, was screened for the presence of 0.7 and 5.0 kb Xho I - Hind III fragments and for linearization to 

45 5.7 kb with EcoR I. 

5. Two PGR SOEing (HcHion et al, (1990) Biotech. 8 : 528-535) reactions were used to create several mu- 
tations in the L6 heavy chain promoter. Outer primers had enzynne sites EcoR V and Sad for sut>doning 
into plG20R to form pMUTL6HCP. The sequence of this insert is shown as bp 7793-8495 in Fig. 12 and 
in Sequence I.D. numt>er 9. 

50 6. The 703 bp EcoR V to Sad fragment from pMUTL6HCP was inserted into the same sites of pN.8 to 
form pN.9. 

7. A 3.5 kb Xho I - BamH I fragment from phtyl A2.5, conteining the PCRed MHEXR (Sequence I.D. Number 
31) plus the 2.8 kb Hindm - BamH I fragment encoding the human yl gene (Sequence I.D. Number 32) 
was inserted into the same sites of pN.9 to form pN/l .16. The sequence of this insert is shown as bp 2- 
55 2799 in Fig. 12 and in Sequence I.D. Number 9. 

Th foregoing description and th Examples are intended as illustrativ and are not to be tak n as limiting. 

StiD thervariations within th spirit and scope of this invention are possible and will readily present themselves 

to those skOled in the art 
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SEQUENCE LISTING 



(1) GENERAL INFORHATION; 



(i) APPLICANT: 

(A) NAME: Bristol-Myers Squil3i> Company 

(B) STREET: 345 Park Avenue 

(C) CITY: New York 

(D) STATE OR PROVINCE: New York 
(B) COUNTRY: USA 

(P) POSTAL CODE: 10154 
15 (G) TELEPHONE: 206-728-4800 

(H) PAX: 206-727-3601 

(ii) TITLE OF THE INVENTION: HUHAHIZED MONOCLONAL ANTIBODIES 

(ill) NUMBER OF SEQUENCES 34 

^ (iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.25 

25 (v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: EP93401328.5 

(B) FILING DATE: 24-MAY-1993 

(C) CLASSIFICATION: 



(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US07/888233 

(B) FILING DATE: 26-MAY-1992 

(C) CLASSIFICATION: 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 361 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

CAGGTCCAAC TTGTCCAGTC CGGTGCCGAA GTTAAGAAGC CTGGCGCTTC TGTGAAGGTC 60 

45 TCCTGCAAGG CTTCTGGCTA CACCTTCACC GACTACTGGA TGAACTGG6T TCGACAGGCA 120 

CCTGGACAGG GCCTAGAGTG GATGGGAAGG ATOGATCCTT CCGATAGTGA AACTCACTAC 180 

AATCAGAAGT TCCAGGGTAG GGTAACAATG ACCCGAGACA CATCCACCAG CACAGTCTAC 240 

ATGGAACTCA GCAGCCTGCG ATCTGAGGAC ACCGCAGTCT ATTACTGTGC ACGAGGTGGA 300 

CGGCTCGGTT CCTTTGCTAT GGACTACTGG GGTCAAGGCA CCCTCGTCAC CGTCTCCTCA 360 



55 
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G 

(2) IKFORHATION FOR SEQ ID HO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

iS FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Gin Val Gin Leu Val Gin Ser Gly Ala Glu Val Lys Lys Pro Gly Ala 
^ 1 5 10 15 

Ser Val Lys Val Ser Cys Lys Ala Ser Gly Tyr Thr Phe Ttir Asp Tyr 
20 25 30 

Trp Met Asn Trp Val Arg Gin Ala Pro Gly Gin Gly Leu Glu Trp Met 
25 35 40 45 

Gly Arg He Asp Pro Ser Asp Ser Glu Thr His Tyr Asn Gin Lys Phe 
50 55 60 

Gin Gly Arg Val Thr Met Thr Arg Asp Thr Ser Thr Ser Thr Val Tyr 

^ 70 75 80 

30 

Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val Tyr Tyr Cys 
85 90 95 

Ala Arg Gly Gly Arg Leu Gly Ser Phe Ala Met Asp Tyr Trp Gly Gin 
100 105 110 

Gly Thr Leu Val Thr Val Ser Ser 
115 120 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 334 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 



45 



(ii) MOLECULE TYPE: DNA (genomic) 



361 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GAAATTGTGC TAACACAATC TCCAGCTACA rPGTCTTTGT CTCCAGGTGA GAGAGCCACT 60 
^ CTATCCTGCA GAGCCA6T6A AAGTGTTGAT AGTTATGGCA ATAGTTTTAT GCACTGGTAC 120 



55 
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CAGCAGAAAC CAGGACAGGC ACCAAGGCTC CTCATCTATC GTGCATCCAA CCTAGAAACT 180 

GGTATCCCTG CCAGGTTCAG TGGCAGTGGT TCTAGGACAG ACTTCACTCT CACCTATTCT 240 

TCTCTAGAGC CTGAAGATTT TGCAGTGTAT TACTGTCAGC AAAGTAATGA GGATCCTCGG 300 

ACX3TTCGGTG GAGGCACCAA GGT6GAAGAG AAAC 334 

(2) INFORHATIOM FOR SEQ ID NO: 4: 

(i) SBQUEMCE CHARACTERISTICS: 
15 (A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(V) PRACTIEHT TYPE: internal 

(Xi) SEQUEKCE DESCRIPTION: SEQ ID N0:4: 



20 



Glu lie Val Leu Thr Gin Ser Pro Ala Thr Leu Ser Leu Ser Pro Gly 
15 10 15 

Glu Arg Ala Thr Leu Ser Cys Arg Ala Ser Glu Ser Val Asp Ser Tyr 
20 25 30 

Gly Asn Ser Phe Met His Trp Tyr Gin Gin Lys Pro Gly Gin Ala Pro 
35 40 45 

Arg Leu Leu lie Tyr Arg Ala Ser Asn Leu Glu Thr Gly lie Pro Ala 
50 55 60 

Arg Phe Ser Gly Ser Gly Ser Arg Thr Asp Phe Thr Leu Thr Tyr Ser 
65 70 75 80 

Ser Leu Glu Pro Glu Asp Phe Ala Val Tyr Tyr cys Gin Gin Ser Asn 
85 90 95 

Glu Asp Pro Arg Thr Phe Gly Gly Gly Thr Lys Val Glu Glu Lys 
100 105 110 

^ (2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 361 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



25 



30 



35 



45 



(ii) MOLECULE TYPE: DMA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
50 CAGGTCCAAC TTCAGCAGCC TGGGCCT6AC CTTCTGAAGC CTGCGGCTCC AGTGAAGCTG 60 



55 
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10 



15 



20 



25 



Gin Val Gin Leu Gin Gin Pro Gly Pro Asp Leu Leu Lys Pro Gly Ala 
^5 10 15 

Pro Val Lys Leu Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Asp Tyr 

30 20 25 30 

Trp Met Asn Trp Val Lys Gin Arg Pro Gly Arg Gly Leu Glu Trp He 

35 40 45 



35 



40 



Gly Arg He Asp Pro Ser Asp Ser Glu Thr His Tyr Asn Gin Lys Phe 
50 55 60 

Lys Asp Lys Ala Thr Leu Thr Val Asp Lys Ser Ser Ser Thr Ala Tyr 
65 70 75 80 

He Gin Leu ser Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr Tyr Cys 
85 90 95 

Ala Arg Gly Gly Arg Leu Gly Ser Phe Ala Met Asp Tyr Trp Gly Gin 
100 105 110 

Gly Thr Ser Val Thr Val Ser Ser 
115 120 

45 (2) IHFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 334 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS: single 

(D) TOPOLOGY: linear 



50 



(ii) MOLECULE TYPE: DMA (genomic) 



361 



TCCTGCAAGG CTTCTGGCTA CACCTTCACC GACTACTGGA TGAACTGGGT TAAGCAGAGG 120 

CCTGGACGAG GCCTCGAGTG GATTGGAAGG ATTGATCCTT CCGATAGTGA AACTCACTAC 180 

AATCAGAAGT TCAAGGACAA GGCCACACTG ACTGTAGACA AATCCTCCAG CACAGCCTAC 240 

ATCCAACTCA GCAGCCTGAC ATCTGAGGAC TCTGCAGTCT ATTACTGTGC ACGAGGGGGA 300 

CGGCTCGGGT CCTTTGCTAT GGACTACTGG GGTCAAGGCA CCTCAGTCAC CGTCTCCTCA 360 
G 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(V) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

10 

GACATTGTGC TAACACAATC TCCAGCTTCT TTGGCTGTGT CTCTAGGGCA GAGGGCCACC 60 
ATATCCTGCA GAGCCAGTGA AAGTGTTGAT AGTTATGGCA ATAGTTTTAT GCACTGGTAC 120 
CA6CAGAAAC CAGGACAGCC ACCCAAACTC CTCATCTATC GTGCATCCAA CCTAGAATCT 180 
t5 GGGATCCCTG CCAGGTTCAG TGGCAGTGGG TCTAGGACAG ACTTCACCCT CACCATTAAT 240 

CCTGTGGAGG CTGATGATGT TGCAACCTAT TACTGTCAGC AAAGTAATGA GGATCCTCGG 300 
ACGTTCGGTG GAG6CACCAA GCTGGAAATC AAAC 334 
(2) INFORMATION FOR SEQ ID NO:B: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (ii) HOLECULE TYPE: peptide 

(V) FRAC9IENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

30 

Asp lie Val Leu Thr Gin Ser Pro Ala Ser Leu Ala Val Ser Leu Gly 
15 10 15 

Gin Arg Ala Thr He Ser Cys Arg Ala Ser Glu Ser Val Asp Ser Tyr 
20 25 30 

^ Gly Asn Ser Phe Met His Trp Tyr Gin Gin Lys Pro Gly Gin Pro Pro 

35 40 45 

Lys Leu Leu He Tyr Arg Ala Ser Asn Leu Glu Ser Gly He Pro Ala 
50 55 60 

^ Arg Phe Ser Gly Ser Gly ser Arg Thr Asp Phe Thr Leu Thr- He Asn 

65 70 75 80 

Pro Val Glu Ala Asp Asp Val Ala Thr Tyr Tyr Cys Gin Gin Ser Asn 
85 90 95 

Glu Asp Pro Arg Thr Phe Gly Gly Gly Thr Lys Leu Glu He Lys 
45 100 105 110 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9201 base pairs 

(B) TYPE: nucleic acid 
^ (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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5 

(ii) MOLECULE TYPE: DNA (genomic) 

10 

(xi) SEQUENCE DESCRIPTIW: SEQ ID NO: 9: 

AAGCTTTCTG GGGCAGGCCA GGCCTGACCT TGGCTTTGGG GCAGGGAGGG GGCTAAGGTG 60 

AGGCAGGTGG CGCCAGCCAG GTGCACACCC AATGCCCATG AGCCCAGACA CTGGACGCTG 120 

15 AACCTCGCGG ACAGTTAAGA ACCCAGGGGC CTCTGCGCCC TGGGCCCAGC TCTGTCCCAC 180 

ACCGCGGTCA CATGGCACCA CCTCTCTTGC AGCCTCCACC AAGGGCCCAT CGGTCTTCCC 240 

CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG GCCCTGGGCT GCCTGGTCAA 300 

_ GGACTACTTC CCCGAACCGG TGACGGTGTC GTCGAACTCA GGCGCCCTGA CCAGCGGC6T 360 
20 

GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC TCCCTCAGCA GCGTGGTGAC 420 

CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC AACGTGAATC ACAAGCCCAG 480 

CAACACCAAG 6TGGACAAAC GCGTTGGT6A GAGGCCAGCA CAGGGAGGGA GGGTGTCTGC 540 

^ TGGAAGCCAG GCTCAGCGCT CCTGCCTGGA C6CATCCCGG CTATGCAGCC CCAGTCCAGG 600 

GCAGCAAGGC AGGCCCCGTC TGCCTCTTCA CCCGGAGGCC TCTGCCCGCC CCACTCATGC 660 

TCAGGGAGAG GGTCTTCTGG CTTTTTCCCC A6GCTCT6GG CA6GCACAGG CTAGGTGCCC 720 

30 CTAACCCAGG CCCTGCACAC AAAGGGGCAG GTGCTGGGCT CAGACCTGCC AAGAGCCATA 780 

TCCGGGAGGA CCCTGCCCCT GACCTAAGCC CACCCCAAAG GCCAAACTCT CCACTCCCTC 840 

AGCTCGGACA CCTTCTCTCC TCCCAGATTC CAGTAACTCC CAATCTTCTC TCTGCAGAGC 900 

^ CCAAATCTTG TGACAAAACT CACACATGCC CACCGTGCCC AGGTAAGCCA GCCCAGGCCT 960 

CGCCCTCCAG CTCAAGGCGG GACAGGTGCC CTAGAGTAGC CTGCATCCAG GGACAGGCCC 1020 

CAGCCGGGTG CTGACACGTC CACCTCCATC TCTTCCTCAG CACCTGAACT CCTGGGGGGA 1080 

CCGTCAGTCT TCCTCTTCCC CCCAAAACCC AAGGACACCC TCATGATCTC CCGGACCCCT 1140 

40 

GAGGTCACAT GCGTGGTGGT GGACGTGAGC CACGAAGACC CTGAGGTCAA GTTCAACTGG 1200 

TACGTGGACG GCGTGGAGGT GCATAATGCC AAGACAAAGC CGCGGGAGGA GCAGTACAAC 1260 

AGCACGTACC GTGTGGTCAG CGTCCTCACC GTCCTGCACC AGGACTGGCT GAATGGCAAG 1320 

45 GAGTACAAGT GCAAGGTCTC CAACAAAGCC CTCCCAGCCC CCATCGAGAA AACCATCTCC 1380 

AAAGCCAAAG GTGGGACCCG TGGGGTGCGA GGGCCACATG GACAGAGGCC GGCTCGGCCC 1440 

ACCCTCTGCC CTGAGAGTGA CCGCTGTACC AACCTCTGTC CCTACAGGGC AGCCCCGAGA 1500 

ACCACAGGTG TACACCCTGC CCCCATCTAG AGAGGAGATG ACCAAGAACC AGGTCAGCCT 1560 

GACCTGCCTG 6TCAAAGGCT TCTATCCCAG CGACATCGCC GTGGAGTGGG AGAGCAATG6 1620 

55 
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10 



20 



25 



GCAGCCGGAG AACAACTACA AGACCACGCC TCCCGTGCTG GACTCCGACG GCTCCTTCTT 1680 

CCTCTACAGC AAGCTCACCG TGGACAAGAG CAGGTGGCAG CAGGGGAACG TCTTCTCATG 1740 

CTCCGTGATG CATGAGGCTC TGCACAACCA CTACACGCAG AAGAGCCTCT CCCTGTCTCC 1800 

GGGTAAATGA GTGCGACGGC CGGCAAGCCC CCGCTCCCCG GGCTCTCGCG GTCGCACGAG 1860 

GATGCTTGGC ACGTACCCCC TGTACATACT TCCCGGGCGC CCAGCATGGA AATAAAGCAC 1920 

15 CCAGCGCTGC CCTGGGCCCC TGCGAGACTG TGATGGTTCT TTCCACGGGT CAGGCCGAGT 1980 

CTGAGGCCTG AGTGGCATGA GGGAGGCAGA GCGGGTCCCA CTGTCCCCAC ACTGGCCCAG 2040 

GCTGTGCAGG TGTGCCTGGG CCCCCTAGGG TCGGGCTCAG CCAGGGGCTG CCCTCGGCAG 2100 

GGTGGGGGAT TTGCCAGCGT GGCCCTCCCT CCAGCAGCAC CTGCCCTGGG CTCGGCCACG 2160 

GGAAGCCCTA GGAGCCCCTG GGGACAGACA CACAGCCCCT GCCTCTGTAG GAGACTGTCC 2220 

TGTTCTGTGA GCGCCCCTGT CCTCCCGACC TCCATGCCCA CTCGGGGGCA TGCCTAGTCC 2280 

ATGTGCGTAG GGACAGGCCX: TCCCTCACCC ATCTACCCCC ACGGCACTAA CCCCTGGCTG 2340 

CCCTGCXrCAG CCTCGCACCC GCATGG66AC ACAACCGACT CCGGGGACAT GCACTCTCGG 2400 

GCCCTGTCGA GGGACTGGTG CAGATGCCCA CACACACACT CAGCCCAGAC CCGTTCAACA 2460 

AACCCCGCAC TGAGGTTGGC CGGCCACACG GCCACCACAC ACACACGTGC ACGCCTCACA 2520 

30 CACGGAGCCT CACCCGGGCG AACTGCACAG CACCCAGACC AGAGCAAGGT CCTCGCACAC 2580 

GTGAACACTC CTCGGACACA GGCCCCCACG AGCCCCACGC GGCACCTCAA GGCCCACGAG 2640 

CCTCTCGGCA GCTTCTCCAC ATGCTGACCT GCTCAGACAA ACCCAGCCCT CCTCTCACAA 2700 

GGGTGCCCCT GCAGCCGCCA CACACACACA GGGGATCACA CACCACGTCA CGTCCCTGGC 2760 

CCTGGCCCAC TTCCCAGTGC CGCCCTTCCC TGCAGGACGG ATCCAGACAT GATAAGATAC 2820 

ATTGATGAGT TTGGACAAAC CACAACTAGA ATGGAGTGAA AAAAATGCFT TATTTGTGAA 2880 

ATTTGTGATG CTATTGCTTT ATTTGTAACC ATTATAAGCT GCAATAAACA AGTTAACAAC 2940 

AACAATTGCA TTCATTTTAT GTTTCAGGTT CAGGGGGAGG TGTGGGAGGT TTTTTAAAGC 3000 

AAGTAAAACC TCTACAAATG TGGTATGGCT GATTATGATC TCTAGTCAAG GCACTATACA 3060 

TCAAATATTC CTTATTAACC CCTTTACAAA TTAAAAAGCT AAAGGTACAC AATTTTTGAG 3120 

45 CATAGTTATT AATAGCAGAC ACTCTATGCC TGTGTGGAGT AAGAAAAAAC AGTATGTTAT 3180 

GATTATAACT GTTATGCCTA CTTATAAAGG TTACAGAATA TTTTTCCATA ATTTTCTTGT 3240 

ATAGCAGTGC AGCTTTTTCC TTTGTGGTGT AAATAGCAAA GCAAGCAAGA GTTCTATTAC 3300 

TAAACACAGC ATGACTCAAA AAACTTAGCA ATTCTGAAGG AAAGTCCTTG GGGTCTTCTA 3360 

CCTTTCTCTT CTTTTTTGGA GGAGTAGAAT GTTGAGAGTC AGCAGTAGCC TCATCATCAC 3420 
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TAGATGGCAT TTCTTCTGAG CAAAACAGGT TTTCCTCATT AAAGGCATTC CACCACTGCT 3480 

CCCATTCATC AGTTCCATAG GTTGGAATCT AAAATACACA AACAATTAGA ATCAGTAGTT 3540 

10 

TAACACATTA TACACTTAAA AATTTTATAT TTACCTTAGA GCTTTAAATC TCTGTAGGTA 3600 

GTTTGTCCAA TTATGTCACA CCACAGAAGT AAGGTTCCTT CACAAA6ATC CGGGACCAAA 3660 

GCGGCCATCG TGCCTCCCCA CTCCTGCAGT TCGGG6GCAT GGATGCGCGG ATAGCCGCTG 3720 

15 CTGGTTTCCT GGATGCC6AC GGATTTGCAC TGCCGGTAGA ACTCCGCGAG GTCGTCCAGC 3780 

CTCAGGCAGC AGCTGAACCA ACTCGCGAGG GGATCGAGCC C6GGGTGGGC GAAGAACTCC 3840 

AGCATGAGAT CCCCGCGCTG GAGGATCATC CAGCCGGCGT CCCGGAAAAC GATTCCGAAG 3900 

_ CCCAACCTTT CATAGAAGGC GGCGGTGGAA TCGAAATCTC GTGATGGCAG GTTGGGCGTC 3960 
20 

GCTTGGTCGG TCATTTCGAA CCCCAGAGTC CCGCTCAGAA GAACTCGTCA AGAAGGCGAT 4020 

AGAAGGCGAT GCGCTGCGAA TCGGGAGCGG CGATACCGTA AAGCACGAGG AAGCGGTCAG 4080 

CCCATTCGCC GCCAAGCTCT TCAGCAATAT CACGGGTAGC CAACGCTATG TCCTGATAGC 4140 

25 

GGTCCGCCAC ACCCAGCCGG CCACAGTCGA TGAATCCAGA AAAGCGGCCA TTTTCCACCA 4200 

TGATATTCGG CAAGCAGGCA TCGCCATGGG TCACGACGAG ATCCTCGCCG TCGGGCATGC 4260 

GCGCCTTGAG CCT6GCGAAC AGTTCGGCTG GCGCGAGCCC CTGATGCTCT TCGTCCAGAT 4320 

30 CATCCTGATC GACAAGACCG GCTTCCATCC GAGTACGTGC TCGCTCGATG CGATGTTTCG 4380 

CTTGGTGGTC GAAT6GGCAG GTAGCCGGAT CAAGCGTATG CAGCCGCCGC ATTGCATCAG 4440 

CCATGATGGA TACTTTCTCG GCAGGAGCAA GGTGAGATGA CAGGAGATCC TGCCCCGGCA 4500 

CTTCGCCCAA TAGCAGCCAG TCCCTTCCCG CTTCAGTGAC AACGTCGA6C ACAGCTGCGC 4560 

35 

AAGGAACGCC CGTCGTGGCC AGCCACGATA GCCGCGCTGC CTCGTCCTGC AGTTCATTCA 4620 

GGGCACCGGA CAGGTCGGTC TTGACAAAAA GAACCGGGC6 CCCCTGCGCT 6ACAGCCGGA 4680 

ACACGGCGGC ATCAGAGCAG CCGATTGTCT GTTGTGCCCA GTCATAGCCG AATAGCCTCT 4740 

40 

CCACCCAAGC 66CCGGAGAA CCTGCGTGCA ATCCATCTTG TTCAATCATG CGAAACGATC 4800 

CTCATCCTGT CTCTTGATCA GATCTTGATC CCCTGCGCCA TCAGATCCTT GGCGGCAAGA 4660 

AAGCCATCCA GTTTACTTTG CAGGGCTTCC CAACCTTACC AGAGGGCGCC CCAGCTGGCA 4920 

45 ATTCCGGTTC GCTTGCTGTC CATAAAACCG CCCAGTCTAG CTATCGCCAT GTAAGCCCAC 4980 

TGCAAGCTAC CTGCTTTCTC TTTGCGCTTG CGTTTTCCCT TGTCCAGATA GCCCAGTAGC 5040 

TGACATTCAT CXZGGGGTCAG CACCGTTTCT GCGGACTGGC TTTCTACGTG TTCCGCTTCC 5100 

TTTAGCAGCC CTTGCGCCCT GAGTGCTTGC GGCAGCGTGA AGCTAGCTTT TTGCAAAAGC 5160 

50 

CTAGGCCTCC AAAAAAGCCT CCTCACTACT TCTGGAATAG CTCAGAGGCX: GAGGCGGCCT 5220 

55 
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CGGCCTCTGC 


ATAAATAAAA 


AAAATTAGTC 


AGCCATGGGG 


CGGAGAATGG 


GGCGGGATGG 


5280 




GCGGAGTTAG 


GGCGGAACTG 


GGCGGAGTTA 


GGGGCX^AC 


TATGGTTGCT 


GACTAATTGA 


5340 


10 


GATGCATGCT 


TTGCATACTT 


CTGCCTGCTG 


GGGAGCCTGG 


GGACTTTCCA 


CACCTGGTTG 


5400 




CTGACTAATT 


GAGATGCATG 


CTTTGCATAC 


TTCTGCCTGC 


TGGGGAGCCT 


GGGGACTTTC 


5460 




CACACCCTAA 


CTGACACACA 


TTCCACAGCT 


GCCTCGCGCG 


TTTCGGTGAT 


GACGGTGAAA 


5520 


IS 


ACCTCTGACA 


CATGCAGCTC 


CCGGAGACGG 


TCACAGCTTG 


TCTGTAAGCG 


GATGCCGGGA 


5580 




GCAGACAAGC 


CCGTCAGGGC 


GCGTCAGCGG 


GTGTTGGCGG 


GTGTCGGGGC 


GCAGCCATGA 


5640 




CCCAGTCACG 


TAGCGATAGC 


GGAGTGTATA 


CTGGCTTAAC 


TATGCGGCAT 


CAGAGCAGAT 


5700 


20 


TGTACTGAGA 


GTGCACCATA 


TGCGGTGTGA 


AATACCXKIAC 


AGAT6CGTAA 


GGAGAAAATA 


5760 




CCGCATCAGG 


CGCTCTTCCG 


CTTCCTCGCT 


CACTGACTCG 


CTGCGCTCGG 


TCGTTCGGCT 


5820 




GCGGCGAGCG 


GTATCAGCTC 


ACTCAAAGGC 


GGTAATACGG 


TTATCCACAG 


AATCAGGGGA 


5880 


25 


TAACGCAGGA 


AA6AACATGT 


GAGCAAAAGG 


CCAGCAAAA6 


GCCAGGAACC 


GTAAAAAGGC 


5940 


CGCGTTGCTG 


GCGTTOTTCC 


ATAGGCTCCG 


CCCCCCTGAC 


GAGCATCACA 


AAAATCGACX; 


6000 




CTCAAGTCAG 


AGGTGGCGAA 


ACCCGACAGG 


ACTATAAAGA 


TACCAGGCGT 


TTCCCCCTGG 


6060 




AAGCTCCCTC 


GTGCGCTCTC 


CTGTTCCGAC 


CCTGCCGCTT 


ACCGGATACC 


TGTCCGCCTT 


6120 


30 


TCTCCCTTCG 


GGAA6CGT6G 


CGCTTTCTCA 


TAGCTCACGC 


TGTAGGTATC 


TCAGTTCGGT 


6180 




GTAGGTCGTT 


CGCTCCAAGC 


TGGGCTGTGT 


GCACGAACCC 


CCXXJTTCAGC 


CCGACCGCTG 


6240 




CGCCTTATCC 


GGTAACTATC 


GTCTTGAGTC 


CAACCCGGTA 


AGACACGACT 


TATCGCCACT 


6300 


35 


GGCAGCAGCC 


ACTGGTAACA 


GGATTAGCAG 


AGCGAGGTAT 


6TAGGCGGTG 


CTACA6AGTT 


6360 




CTTGAAGTGG 


TGGCCTAACT 


ACGGCTACAC 


TAGAAGGACA 


GTATTTGGTA 


TCTGCGCTCT 


6420 




GCTGAAGCCA 


GTTACCTTCG 


GAAAAAGAGT 


TGGTAGCTCT 


TGATCCGGCA 


AACAAACCAC 


6480 




CGCTGGTAGC 


GGTGGTTTTT 


TTGTTTGCAA 


GCAGCAGATT 


ACGCGCAGAA 


AAAAAGGATC 


6540 


40 


TCAAGAAGAT 


CCTTTGATCT 


TTTCTACGGG 


GTCTGACGCT 


CAGTGGAACG 


AAAACTCACG 


6600 




TTAAGGGATT 


TTGGTCATGA 


GATTATCAAA 


AAGGATCTTC 


ACCTAGATCC 


TTTTAAATTA 


6660 




AAAATGAAGT 


TTTAAATCAA 


TCTAAAGTAT 


ATATGAGTAA 


ACTTGGTCTG 


ACAGTTACCA 


6720 


45 


ATGCTTAATC 


AGTGAGGCAC 


CTATCTCAGC 


GATCTGTCTA 


TTTCGTTCAT 


CCATAGTTGC 


6780 




CTGACTCCCC 


GTCGTGTAGA 


TAACTACGAT 


ACGGGAGGGC 


TTACCATCTG 


GCCCCAGTGC 


6840 




TGCAATGATA 


CCGCGAGACC 


CACGCTCACC 


GGCTCCAGAT 


TTATCAGCAA 


TAAACXAGCC 


6900 


50 


AGCCGGAAGG 


GCCGAGCGCA 


GAAGTGGTCX: 


TGCAACTTTA 


TCCGCCTCCA 


TCCAGTCTAT 


6960 




TAATTGTTGC 


CGGGAAGCTA 


GAGTAAGTAG 


TTCGCCAGTT 


AATAGTTTGC 


GCAACGl^rGT 


7020 
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40 



TGCCATTGCT 


GCAGGCATCG 


TGGTGTCACG 


CTCGTCGTTT 


GGTATGGCTT 


CATTCAGCTC 


7080 


CGGTTCCCAA 


CGATCAAGGC 


GAGTTACATG 


ATCCCCCATG 


TTGTGCAAAA 


AAGCGGTTAG 


7140 


CTCCTTCGGT 


CCTCCGATCG 


TTGTCAGAAG 


TAAGTTGGCC 


GCAGTGTTAT 


CACTCATGGT 


7200 


TATGGCAGCA 


CTGCATAATT 


CTCTTACTGT 


CATGCCATCC 


GTAAGATGCT 


TTTCTGTGAC 


7260 


TGGTGAGTAC 


TCAACCAAGT 


CATTCTGAGA 


ATAGTGTATG 


CGGCGACCGA 


GTTGCTCTTG 


7320 


CCCGGCGTCA 


ACACGGGATA 


ATACCGCGCC 


ACATAGCAGA 


ACTTTAAAAG 


TGCTCATCAT 


7380 


TGGAAAAC6T 


TCTTCGGGGC 


GAAAACTCTC 


AAGGATCTTA 


CCGCTGTTGA 


GATCCAGTTC 


7440 


GATGTAACCC 


ACTCGTGCAC 


CCAACTGATC 


TTCAGCATCT 


TTTACTTTCA 


CCAGCGTTTC 


7500 


TGGGTGAGCA 


AAAACAGGAA 


GGCAAAATGC 


CGCAAAAAAG 


GGAATAAGGG 


CGACACGGAA 


7560 


ATGTTGAATA 


CTCATACTCT 


TCCTTTTTCA 


ATATTATTGA 


AGCATTTATC 


AGGGTTATTG 


7620 


TCTCATGAGC 


6GATACATAT 


TTGAATGTAT 


TTAGAAAAAT 


AAACAAATAG 


GGGTTCCGCG 


7680 


CACATTTCCC 


CGAAAA6TGC 


CACCTGACGT 


CTAAGAAACC 


ATTATTATCA 


TGACATTAAC 


7740 


CTATAAAAAT 


AOGCGTATCA 


CGAGGCCCTT 


TCGTCTTCAA 


GAATTCATCG 


ATATCGGAAA 


7800 


AT6AAAAAAA 


ATATTTTTTA 


ATTTTAAAAT 


GAAATGTTTA 


TOTTCAATTT 


CTCCAAATTT 


7860 


CACAAGGAAA 


GATTAGTCAC 


GGGTATGGGA 


GAGCAGAGGA 


CCATAAGAGT 


TCAGGAATAG 


7920 


AATCCATTAT 


GATTCTGGAG 


TCAAGGAAGT 


ACTGATGCCA 


AGGTTTCAGT 


ATAAGAGCAG 


7980 


TATCCACTGG 


AAAGGATAAA 


GTCACTACAA 


CTGAGCACAG 


AGCAGGACAG 


CTACCTAATG 


8040 


AGT6GTCACT 


AATGGGCCAC 


TGTTACACTG 


TTATACGGCT 


TAGGAATGAG 


CACTGAGGCT 


8100 


GTGAGGT6TA 


TGGGTAAGGA 


CATCAGGATG 


TAAACCCA6C 


TCAGGTAGAG 


GACTCAGAGC 


8160 


ACAGGACAAT 


CAGCACGAAC 


TAATAAACAA 


CAGATAAGAT 


AAGGCACAAG 


CTCAGCAATA 


8220 


TTGGATCAGG 


GATCTTTGTA 


AATCTGACTG 


TGTATTCAGT 


CTAGTTCAAT 


GTGACTCATG 


8280 


AAGCCCACCC 


ATATGCAAAT 


CTAGAGAAGA 


crrrAGAGTA 


TAAATCTGAG 


GCTCACCTCA 


8340 


CATACCAGCA 


AGGGAGTGAC 


CAGCTTGTCT 


TAAGGCACCA 


CPGAGCCCAA 


GTCTTAGACA 


8400 


TCATGGATTG 


GCTGTGGAAC 


TTGCTATTCC 


TGATGGCAGC 


TGCCCAAGGT 


AAGTCATCAG 


8460 


AAAAAAGAGT 


TCGAAGGGAA 


ATTGAAGCAG 


TTCCGAGCTC 


GGTACCCTCG 


AGATCCTAGA 


8520 


GAGGTCTGGT 


GGAGCCTGCA 


AAAGTCCAGC 


TTTCAAAGGA 


ACACAGAAGT 


ATGTGTATGG 


8580 


AATATTAGAA 


GATGTTGCTT 


TTACTCTTAA 


GTTGGTTCCT 


AGGAAAAATA 


GTTAAATACT 


8640 


GTGACTTTAA 


AATGTGAGAG 


GGTTTTCAAG 


TACTCATTTT 


TTTAAATGTC 


CAAAATTTTT 


8700 


GTCAATCAAT 


TTGAGGTCTT 


GTTTGTGTAG 


AACTGACATT 


ACTTAAAGTT 


TAACCGAGGA 


8760 


ATGGGAGTGA 


GGCTCTCTCA 


TACCCTATCC 


AGAACTGACT 


TTTAACAATA 


ATAAATTAAG 


8820 
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TTTAAAATAT TTTTAAATGA ATTGAGCAAT GTTGAGTTGA 
CCAGAACACC TGCAGCAGCT GGCAGGAAGC AGGTCATGTG 
GAAAATAAAA CCACTAGGTA AACTTGTAGC TGTGGTTTGA 
CTGTCCAGCC CCACCAAACC GAAAGTCCAG GCTGAGCAAA 
ATTTCTAAAA TAAGTTGAGG ATTCAGCCGA AACTGGAGAG 
GTTCAACCTT TTAATTTTAG CTTGAGTAGT TCTAGTTTCC 
TTCTAAAATG TATTTAGAAT T 
(2) INPORHATION FOR SEQ ID NO: 10: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7059 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 



GTCAAGATGG 
GCAAGGCTAT 
AGAAGTGGTT 
ACACCACCTG 
GTCCTCTTTT 
CCAAACTTAA 



CCGATCAGAA 
TTGGGGAAGG 
TTGAAACACT 
GGTAATTTGC 
AACTTATTGA 
GTTTATCGAC 



88B0 
8940 
9000 
9060 
9120 
9180 
9201 



30 



35 



40 



45 



SO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

GGATCCAGAC ATGATAAGAT ACATTGATGA GTTTGGACAA ACCACAACTA GAATGCAGTG 60 

AAAAAAATGC TTTATTTGTG AAATTTGTGA TGCTATTGCT TTATTTGTAA CCATTATAAG 120 

CTGCAATAAA CAAGTTAACA ACAACAATTG CATTCATTTT ATGTTTCAGG TTCAGGGGGA 180 

GGTGTGGGA6 GTTTTTTAAA GCAAGTAAAA CCTCTACAAA TGTGGTATGG CT6ATTATGA 240 

TCTCTAGTCA AGGCACTATA CATCAAATAT TCCTTATTAA CCCCTTTACA AATTAAAAAG 300 

CTAAAGGTAC ACAATTTTTG AGCATAGTTA TTAATAGCAG ACACTCTATG CCTGTGTGGA 360 

GTAAGAAAAA ACAGTATGTT ATGATTATAA CTGTTATGCC TACTTATAAA GGTTACAGAA 420 

TATTTTTCCA TAATTTTCTT GTATAGCAGT GCAGCTTTTT CCTTTGTGGT GTAAATAGCA 480 

AAGCAAGCAA GAGTTCTATT ACTAAACACA GCATGACTCA AAAAACTTAG CAATTCTGAA 540 

GGAAAGTCCT TGGGGTCTTC TACCTTTCTC TTCTTTTTTG GAGGAGTAGA ATGTTGAGAG 600 

TCAGCAGTAG CCTCATCATC ACTAGATGGC ATTTCTTCTG AGCAAAACAG GTTTTCCTCA 660 

TTAAAGGCAT TCCACCACTG CTCCCATTCA TCAGTTCCAT AGGTTGGAAT CTAAAATACA 720 

CAAACAATTA GAATCAGTAG TTTAACACAT TATACACTTA AAAATTTTAT ATTTACCTTA 780 

GAGCTTTAAA TCTCTGTAGG TAGTTT6TCC AATTATGTCA CACCACAGAA GTAAGGTTCC 840 

TTCACAAAGA TCCGGGGCCC ACTCATAAAT CCAGTTGCCG CCACGGTAGC CAATCAOCGT 900 

ATCGTATAAA TCATCGTCGG TACGTTCGGC ATCGCTCATC ACAATACGTG CCTGGACGTC 960 
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GAGGATTTCG 


CGTGGGTCAA 


TGCCGCGCCA 


GATCCACATC 


AGACGGTTAA 


TCATGCGATA 


1020 


CCAGTGAGGG 


ATGGTTTTAC 


CATCAAGGGC 


CGACTGCACA 


GGCGGTTGTG 


CGCCGTGATT 


1080 


AAAGCGGCGG 


ACTAGCGTCG 


AGGTTTCAGG 


ATGTTTAAAG 


CGGGGTTTGA 


ACAGGGTTTC 


1140 


GCTCAGGTTT 


GCCTGTGTCA 


TGGATGCAGC 


CTCCAGAATA 


CTTACTGGAA 


ACTATTGTAA 


1200 


CCCGCCTGAA 


GTTAAAAAGA 


ACAACGCCCG 


GCAGTGCCAG 


GCGTTGAAAA 


GATTA6CGAC 


1260 


CGGAGATTGG 


CGGGACGAAT 


AC6AC6CCCA 


TATCCCACGG 


CTGTTCAATC 


CAGGTATCTT 


1320 


GCGGGATATC 


AACAACATAG 


TCATCAACCA 


GCGGACGACC 


AGCCGGTTTT 


GCGAAGATGG 


1380 


TGACAAAGTG 


CGCTTTTGGA 


TACATTTCAC 


GAATCGCAAC 


CGCAGTACCA 


CCGGTATCCA 


1440 


CCAGGTCATC 


AATAACGATG 


AAGCCTTCGC 


CATCGCCTTC 


TGCGCGTTTC 


AGCACTTTAA 


1500 


GCTCGCGCTG 


GTTGTCGTGA 


TCGTAGCTGG 


AAATACAAAC 


GGTATCGACA 


TGACGAATAC 


1560 


CCAGTTCACG 


CGCCAGTAAC 


GGACCCGGTA 


CCAGACCGCC 


ACGGCTTACX; 


GCAATAATGC 


1620 


CTTTCCATTG 


rrCAGAAGGC 


ATCAGTCGGC 


TTGCGAGTTT 


ACGTGCATGG 


ATCTGCAACA 


1680 


TGTCCCAGGT 


GACGATGTAT 


TTTTCGCTCA 


TGTGAAGTGT 


CCCAGCCTGT 


TTATCTACGG 


1740 


CTTAAAAAGT 


GTTCGAGGGG 


AAAATAGGTT 


GCGCGAGATT 


ATAGA6ATCA 


GCTTTTTGCA 


1800 


AAAGCCTAGG 


CCTCCAAAAA 


AGCCTCCTCA 


CTACTTCTGG 


AATAGCTCAG 


AGGCCGAGGC 


1860 


GGCCTCGGCC 


TCTGCATAAA 


TAAAAAAAAT 


TAGTCAGCCA 


TGGGGCGGAG 


AATGGGGCGG 


1920 


GATGGGCGGA 


GTTAGGGCGG 


AACTGGGCGG 


AGTTAGGGGC 


GGGACTATGG 


TTGCTGACTA 


1980 


ATTGAGATGC 


TGCATTAATG 


AATCGGCCAA 


CGC6CGGGGA 


GAGGCGGTTT 


GCGTATTGGG 


2040 


CGCTCTTCCG 


CTTCCTCGCT 


CACTGACTCG 


CTGCGCTCGG 


TCGTTCGGCT 


GCGGCGAGCG 


2100 


GTATCAGCTC 


ACTCAAAGGC 


GGTAATACGG 


TTATCCACAG AATCAGGGGA 


TAACGCAGGA 


2160 


AAGAACATGT 


GAGCAAAAGG 


CCAGCAAAAG 


GCCAGGAACC 


GTAAAAAG6C 


CGCCrPGCTG 


2220 


GCGTTTTTCC 


ATAGGCTCC6 


CCCCCCTGAC 


GAGCATCACA 


AAAATCGACG 


CTCAAGTCAG 


2280 


AGGTGGCGAA 


ACCCGACAGG 


ACTATAAAGA TACCAGGCGT 


TTCCCCCTGG 


AAGCTCCCTC 


2340 


GTGCGCTCTC 


CTGTTCCGAC 


CCTGCCGCTT 


ACGGGATACC 


TGTCCGCCTT 


TCTCCCTTCG 


2400 


GGAA6C6TGG 


CGCTTTCTCA 


ATGCTCACGC 


TGTAGGTATC 


TCAGTTCGGT 


GTAGGTCGTT 


2460 


CGCTCCAAGC 


TGGGCTGTGT 


GCACGAACCC 


CCCGTTCAGC 


CCGACCGCTG 


CGCCTTATCC 


2520 


GGTAACTATC 


GTCTTGAGTC 


CAACCCGGTA 


AGACACGACT 


TATCGCCACT 


GGGAGCAGCC 


2580 


ACTGGTAACA 


GGATTA6CA6 


AGCGAGGTAT 


GTAGGGGGTG 


CTACAGAGTT 


CTTGAAGTGG 


2640 


TGGCCTAACT 


ACGGCTACAC 


TAGAAGGACA 


GTATTTGGTA 


TCTGCGCTCT 


GCTGAAGCCA 


2700 


GTTACCTTCG 


GAAAAAGAGT 


TGGTAGCTCT 


TGATCCGGCA 


AACAAACCAC 


CGCTGGTAGC 


2760 
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GGTCGTITTT TTGTTTGCAA 6CAGCAGATT ACGCGCAGAA AAAAAGGATC TCAAGAAGAT 2820 

CCTTTGATCT TTTCTACGGG GTCTGACGCT CAGTGGAACG AAAACTCACG TTAAGGGATT 2880 

10 

TTGGTCATGA GATTATCAAA AAGGATCTTC ACCTAGATCC TTTTAAATTA AAAATGAAGT 2940 

TTTAAATCAA TCTAAAGTAT ATAT6AGTAA ACTTGGTCTG ACAGTTACCA ATGCTTAATC 3000 

AGTGAGGCAC CTATCTCAGC GATCTGTCTA TTTCGTTCAT CCATAGTTGC CTGACTCCCC 3060 

IS GTCGTGTAGA TAACTACGAT ACGG6AGGGC TTACCATCTG GCCCCAGTCC TGCAATGATA 3120 

CCGCGAGACC CACGCTCACC GGCTCCAGAT TTATCAGCAA TAAACCAGCC A6CCGGAAGG 3180 

GCCGAGCGCA GAAGTGGTCC TGCAACTTTA TCCGCCTCCA TCCAGTCTAT TAATTGTTGC 3240 

_ CXK^GAAGCTA GAGTAAGTAG TTCGCCAGTT AATAGTTTGC GCAACGTTGT TGCCATTGCT 3300 
20 

ACAGGCATCG TGGTGTCACG CTCGTCGTTT GGTATGGCTT CATTCAGCTC CGGTTCCCAA 3360 

CGATCAAGGC GAGTTACATG ATCCCCCATG TTGTGCAAAA AAGCGGTTAG CTCCTTCGGT 3420 

CCTCCGATCG TTGTCAGAAG TAAGTTGGCC GCAGTGTTAT CACTCATGGT TATGGCAGCA 3480 

CTGCATAATT CTCTTACTGT CATGCCATCC GTAAGATGCT TTTCTGTGAC TGGTGAGTAC 3540 

TCAACCAAGT CATTCTGAGA ATAGTGTATG CGGCGACCGA GTTGCTCnrrG CCCGGCGTCA 3600 

ATACGGGATA ATACCGCGCC ACATAGCAGA ACTTTAAAAG TGCTCATCAT TGGAAAACGT 3660 

30 TCTTCGGGGC GAAAACTCTC AAGGATCTTA CCGCTGTTGA GATCCAGTTC GATGTAACCC 3720 

ACTCGTGCAC CCAACTGATC TTCAGCATCT TTTACTTTCA CCAGCGTTTC TGGGTGAGCA 3780 

AAAACAGGAA GGCAAAATGC CGCAAAAAAG GGAATAAGGG CGACACGGAA ATGTTGAATA 3840 

^ CTCATACTCT TCCTTTTTCA ATATTATTGA AGCATTTATC AGGGTTATTG TCTCATGAGC 3900 

GGATACATAT TTGAATGTAT TTAGAAAAAT AAACAAATAG GGGTTCCGCG CACATTTCCC 3960 

CGAAAAGTGC CACCTGACGT CTAAGAAACC ATTATTATCA TGACATTAAC CTATAAAAAT 4020 

AGGCGTATCA CGAGGCCCTT TCGTCTCGCG CGTTTCGGTG ATGACGGTGA AAACCTCTGA 4080 

40 

CACATGCAGC TCCCGGAGAC GGTCACAGCT TGTCTGTAAG CGGATGCCGG GAGCAGACAA 4140 

GCCCGTCAGG GCGCGTCAGC GGGTGTTGGC GGGTGTCGGG GCTGGCTTAA CTATGCGGCA 4200 

TCAGAGCAGA TTGTACTGAG AGTGCACCAT AGCGGCGGCA TATGCGGTGT GAAATACCGC 4260 

45 ACAGATGCGT AAGGAGAAAA TACCGCATCA GGCGCCATTC GCCATTCAGG CTCCGCAACT 4320 

GTTGGGAAGG GCGATCGGTG CGGGCCTCTT CGCTATTACG CCAGAATTCG GCCCAGGGGA 4380 

CTGTGAGGAC AGAAGGCTTG TGGGTTTGAG GGAGGACTGT CTTGCAGAGG ATGATAGGGT 4440 

AAAATAGAAT GAAGGATGAT TTTTATAAAT GGTTAC!GTGC CTTAGGATGA CTACATATTT 4500 

50 

AGTCCCTTAT AAGAGAAATT GAGTAGTTGG TAAAACAACA GATAATAATT ATTAAATGAG 4560 

55 
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GAAAGAGAGA 


AACCACAGGT 


GCAAAGATTC 


ACTTTATTTA 


TTCATTCTCC 


TCCAACATTA 


4620 


10 


GCATAATTAA 


AGCCAAGGAG 


GAGGAGGGGG 


GTGAGGTGAA 


AGATGAGCTG 


GAGGACCGCA 


4680 


ATAGGGGTAG 


GTCCCCTGTG 


GAAAAAGGGT 


CAGAGGCCAA 


AGGATGGGAG 


GGGGTCAGGC 


4740 




TGGAACTGAG 


GAGCAGGTGG 


GGGCACTTCT 


CCCTCTAACA 


CTCTCCCCTG 


TTGAAGCTCT 


4800 




TTGTGACGGG 


CGAGCTCAGG 


CCCTGATGGG 


TGACTTCGCA 


GGCGTAGACT 


TTGTGTTTCT 


4860 


15 


CGTAGTCTGC 


TTTGCTCAGC 


GTCAGGGTGC 


TGCTGAGGCT 


GTAGGTGCTG 


TCCTTGCTCT 


4920 




CCTGCTCTGT 


GACACTCTCC 


TGGGAGTTAC 


CCGATTGGAG 


GGCGTTATCC 


ACCTTCCACT 


4980 




GTACTTTGGC 


CTCTCTGGGA 


TAGAAGTTAT 


TCAGCAGGCA 


CACAACAGA6 


GCAGTTCCAG 


5040 


20 


ATTTCAACTG 


CTCATCAGAT 


6GCGGGAAGA 


TGAAGACAGA 


TGGTGCAGCC 


ACAGTTCCTG 


5100 




AGGAAAGAA6 


CAAACAGGAT 


GGTGTTTAAG 


TAACAAAGTT 


CTGCCCTTGG 


GTGTGTTGTT 


5160 




TGCGGATAAG 


GGCATGTTAG 


GGACAGACAG 


AAAACAGCAT 


GCTTATCCCA 


GATAATTATA 


5220 


25 


GCAAGGAGAC 


CAAGAAGCGT 


ATTTAAAATC 


TTGATGTTTT 


GAGTTTCTTC 


CTAGCTTCCC 


5280 


CCTATTCCTT 


AATAAAGTTC 


TAAATTGTTT 


TGTTGGAGCT 


CTTTGCAGCC 


ATTCTGAGGG 


5340 




CTTTGCATGC 


TTTTCTGACC 


TTGCAGTAAA 


CTCAATGCTT 


TAGGCAAAGA 


ATGGCCACGT 


5400 




CATCCGACCC 


CCTCAGAGTT 


TAGAATTCAT 


CGATATCTAG 


ATCCTAGATA 


ATTGCATTCA 


5460 


30 


TTTAAAAAAA 


AAATATTTCT 


CCTAAAATGA 


ATACTCAGAA 


AGTGGTCTTG 


AAAAAGATTT 


5520 




GTGAAGCCGT 


TTTGACCAGA 


ATGTCAAAGT 


CTTAATAGTA 


AGGCAAAACA 


AACAACTAAA 


5580 




AAAGATCATG 


AACAAAGTCA 


CTGTAAAGAC 


TTCGGGTATT 


GGAAAATAAT 


TGAATGGAGA 


5640 


35 


CCAATAATCA 


GAGGGAAGAA 


TAATAGAGTA 


ATTTTAAGAA 


GTTTTCTAAA 


TATATTAGAA 


5700 




ATTAAA6ACA 


CTAAAGTCCT 


TCAATTTCTT 


ACATAACCTA 


ATTTTGAAAA 


TGAATTCTAA 


5760 




ATACATTTTA 


GAAGTCGATA 


AACTTAAGTT 


TGGGGAAACT 


AGAACTACTC 


AAGCTAAAAT 


5820 


40 


TAAAAGGTT6 


AACTCAATAA 


GTTAAAAGAG 


GACCTCTCCA 


GTTTCGGCTG 


AATCCTCAAC 


5880 


TTATTTTAGA 


AATGCAAATT 


ACCCAGGTGG 


TGTTTTGCTC 


AGCCTGGACT 


TTCGGTTTGG 


5940 




TGGGGCTGGA 


CAGAGTGTTT 


CAAAACCACT 


TCTTCAAACC 


ACAGCTACAA 


GTTTACCTAG 


6000 




TGGTTTTATT 


TTCCCTTCCC 


CAAATAGCCT 


TGCCACATGA 


CCTGCTTCCT 


GCCAGCTGCT 


6060 


45 


GCAGGTGTTC 


TGGTTCTGAT 


CGGCCATCTT 


GACTCAACTC 


AACATTGCTC 


AATTCATTTA 


6120 




AAAATATTTT 


AAACTTAATT 


TATTATTGIT 


AAAAGTCAGT 


TCTGGATAGG 


GTATGAGA6A 


6180 




GCCTCACTCC 


CATTCCTCGG 


TTAAACTTTA 


AGTAAOXSTCA 


GTTCTACACA 


AACAA6ACCT 


6240 


50 


CAAATTGATT 


GACAAAAATT 


TTGGACATTT 


AAAAAAATGA 


GTACTTGAAA 


ACCCTCTCAC 


6300 




ATTTTAAAGT 


CACAGTATTT 


AACTATTTTT 


CCTAGGAACC 


AACTTAAGAG 


TAAAAGCAAC 


6360 
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ATCTTCTAAT 


ATTCCATACA 


CATACTTCTG 


TGTTCCTTTG 


AAAGCTGGAC 


TTTTGCAGGC 


6420 


10 


TCCACCAGAC 


CTCTCTAGGA 


TCTCGAGCTC 


GCGAAAGCTT 


GCATGCCTGC 


AGGTCGACTC 


6480 


TAGAGGATCA 


AACCAACTGT 


CTTTGAGTAG 


AGCCAAAATT 


GTTGATATAC 


TTTGAATTIT 


6540 




AATTATATTT 


CTTGCTGAGC 


A6AGGTGGCA 


AGAGTTTTCA 


CTAATGTGCA 


AAACCACCTC 


6600 




ATGTTCCCCT 


CACCTGGGAG 


CCAGAGTAGC 


A6GAG6AAGA 


6AAGCTGAGC 


TGGGGCTTCC 


6660 


15 


ATGGTTCCCT 


CTGGGTCCTA 


ACTGAGCAGT 


TCCTCCCCAG 


GGCTCTGACA 


CAGGCATTGA 


6720 




TATGGGCTCT 


GGAA6GTAGG 


GCAGCTGGGA 


GG6ACATGCA 


AAGCAGCTGG 


GTGGGAGCTG 


6780 




AGCTTCCAGC 


TGCA6AGACC 


ACCTGCTTCT 


TCCTCTCTGC 


ACTGAGCATC 


CTGCGCCACC 


6840 


20 


CTGGTTGTCA 


GGCCAGAAAA 


GTCTGTTGGC 


TCAGTCTGAG 


TGTAGAACTT 


CTCCCTTGTG 


6900 




CTCAGAGAAT 


TTCATTCCTA 


TGTCTTTCTT 


CTCCTCAATC 


ACCTAAATTC 


ACCCAGATGA 


6960 




TGTTTGGCAC 


AAGCCTGTTA 


AGAACAATAT 


AAAAGGCTGT 


GTTTTCATTT 


CTCTCTTCCT 


7020 




ATCCTCAATA 


TGCCCAGTCA 


TCTCCCTAAG 


TGCATTATT 






7059 


25 


(2) INFORMATION FOR SEQ ID NO: 11: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) HOLECOLE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

35 

AAAAAAGAAT TCGAGCTCTT CTGATAACGC TGTCCTTCTG TTTGCAGGTG TCCAGTGTCA 
GGTCCAACTT GTCCAGTCCG G * 
(2) INFORMATION FOR SEQ ID NO: 12: 

40 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (ii) IK)LECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
^ TTAACTTCGG CACCGGACTG GACAAGTTGG ACCTGACACT GGACACCTGC AAACAGAA6G 60 

ACAGCGTTAT CAGAAGAGCT CGAATTCTTT TTT 93 



55 



31 



EP0 578 515 A2 



10 



(2) INFORMATION FOR SBQ ID HO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 
TGCCGAAGTT AAGAAGCCTG GCGCTTCTGT GAAGGTCTCC TGCAAGGCTT CTGGCTACAC 60 
CTTCACCGAC TACTGGATGA ACTGGGTTCG 90 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

30 (Jti) SEOTENCE DESCRIPTION: SEQ ID NO: 14: 

CCAGGTGCCT GTCGAACCCA GTTCATCCAG TAGTCGGTGA AGGTGTAGCC AGAAGCCTTG 60 
CAGGAGACCT TCACAGAAGC GCCAGGCTTC 90 
(2) INFORMATION FOR SBQ ID N0:15: 



15 



20 



25 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

45 ACAGGCACCT GGACAGGGCC TAGAGTGGAT GGGAAGGATT GATCCTTCCG ATAGTGAAAC 60 

TCACTACAAT CAGAAGTTCC AGGGTAGGGT 90 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTBRISTICS : 
SO (A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDMKSS: single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA (qenoaic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CGGGTCATTG TTACCCTACC CTGGAACTTC TGATTGTAGT GAGTTTCACT ATCGGAAGGA 60 
,5 TCAATCCTTC CCATCCACTC TAGGCCCTGT 90 
(2) INFORMATION FOR SEQ ID NO:17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEWffESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID HO: 17: 

AACAATGACC CGAGACACAT CCACCAGCAC AGTCTACATG GAACTCAGCA GCCTGCGATC 60 
TGAGGACACC GCAGTCTATT ACTGTGCACG 90 

30 (2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
^ AGCCGTCCAC CTCGTGCACA GTAATAGACT GCGGTGTCCT CAGATCGCAG GCTGCTCAGT 60 
TCCATGTAGA CTGTGCTGGT GGATGTGTCT 90 
(2) INFORMATION FOR SEQ ID NO: 19: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



50 



(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

AGGTGGACGG CTCGGTTCCT TTGCTATGGA CTACTGGGGT CAAGGCACCC TCGTCACCGT 60 

CTCCTCAGCT GAGTCCOXIAC ACTCGAGGTC GAC 93 

(2) INFORMATION FOR SEQ ID NO: 20: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 87 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 

(xi) SEQUiENCE DESCRIPTION: SEQ ID NO: 20: 
AAAAAAGTCG ACCTCGAGTG TGAGGACTCA CCTGAGGAGA CGGT6ACGAG GGTGCCTTGA 60 
CCCCAGTAGT CCATAGCAAA GGAACCG 87 
(2) INFORMATION FOR SEQ ID NO: 21: 



20 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 21: 
TTTTTTGAAT TCAAGCTTTC CTGACTACAT GAGTGCATirr CTGTTTTATT TCCAATTTCA 60 
GATACCACCG GAGAAATTGT GCTAACACAA 90 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
^ AATTTCTCCG GTGGTATCTG AAATTGGAAA TAAAACAGAA ATGCACTCAT GTAGTCAGGA 60 

AAGCTTGAAT TCAAAAAA 78 
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(2) INFORMATION PGR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) HOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
TCTCCAGCTA CATTGTCTTT GTCTCCAGGT GAGAGAGCCA CTCTATCCTG CAGAGCCAGT 
GAAAGTGTT6 ATAGTTATGG CAATAGT 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) HOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
ACTATCAACA CTTTCACTGG CTCTGCAGGA TAGAGTGGCT CTCTCACCTG GAGACAAAGA 
CAATGTAGCT GGAGATTGTG TTAGCAC 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TTTATGCACT GGTACCAGCA GAAACCAGGA CAGGCACCAA GGCTCCTCAT CTATCGTGCA 
TCCAACCTAG AAACTGGTAT CCCTGCC 
(2) INFORMATION FOR SEQ ID N0:26: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
AGTTTCTAGG TTGGATGCAC GATAGATGAG GAGCCTTGGT GCCTGTCCTG GTTTCTGCTG 60 
15 GTACCAGTGC ATAAAACTAT TGCCATA 87 

(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 
^ (C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

AGGTTCAGTG GCAGTGGTTC TAGGACAGAC TTCACTCTCA CCTATTCTTC TCTAGAGCCT 60 
GAAGATTTTG CAGTGTATTA CTGTCAG 87 

30 (2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQITENCE CHARACTERISTICS: 

(A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

^ (D) TOPOLOGY: linear 

35 

(li) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

40 

CACTGCAAAA TCTTCAGGCT CTAGAGAAGA ATAGGTGAGA GTGAAGTCTG TCCTAGAACC 60 
ACTGCCACTG AACCTGGCAG GGATACC 87 
(2) INFORMATION FOR SEQ ID NO: 29: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



50 



(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

CAAAGTAATG AGGATCCTCG GACGTTCGGT GGAGGCACCA AGGTGGAAGA GAAACGTAAG 60 

TGCACTTTCC TCGAGGTCGA CmnT l ' 87 

(2) INFORMATION FOR SEQ ID NO:30: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 99 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
AAAAAAGTCG ACCTCGAGGA AAGTGCACTT ACGTTTCTCT TCCACCTTGG TGCCTCCACC 60 
GAACGTCCGA GGATCCTCAT TACTTTGCTG ACAGTAATA 99 
(2) INFORMATION FOR SEQ ID NO:31: 



20 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 31: 
AAAAAAGAAT TCGAGCTCTT TTCTGATAAC GTTGTCCTTC TGTTTCTTGC AGGTGTCCAG 60 
TGTCAGGTCC AACTTCAGCA GCCT6GG 87 
(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: DNA (genomic) 

(iv) ANTI-SENSE: YES 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
AAAAAAGTCG ACTGTGAGGA CTCACCTGAG GAGACGGTGA CTGAGGTGCC T 51 
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(2) INFORMATION FOR SEQ ID MO: 33: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 96 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
AAAAAAGAAT TCAAGCTTTC CTGACTACAT GAGTGCATTT CTGTTTTATT TCCAATTTCA 
20 GATACCACCG GAGACATTGT GCTAACACAA TCTCCA 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
AAAAAAGTCG ACCTCGAGAT CACTTACGTT TGATTTCCAG CTTGGTGCCT CCAC 

35 



40 



45 



50 
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Claims 

5 I.Am thod for producing a humanized monod nal antibody by utilizing a process of comparativ model 
building comprising: 

a) selecting a monoclonal antibody to be humanized; 

b) searching computer databanks for protein crystal structures that demonstrate greater than 50 per- 
cent sequence homology to the variable region of said antibody to produce a structural template; 

10 c) determining the structure of the complementarity determining region, or CDR. loops and assigning 

the loops to canonical loop conformations; 

d) determining the framework reskJues which are crucial to the conformation of the CDR bops; 

e) replacing the CDR loops of the structural templates with canonical CDR backbone templates using 
interactive computer graphics; 

IS f) searching computer databanks to extract Initial backbone approximations for each loop for norhcan- 

onical CDR loops; 

g) replacing all non-conserved amino-acid side chains in similar positfons on sakJ antibody and on the 
computer model with human amino acki reskJues using interactive computer graphics to produce a mod- 
el having a combination of backt)one fragmente of different antibodies with replaced skJe chains; 
20 h) sdvating the OKXIels with a water layer corresponding to about 7 angstronrts; 

i) refining the structure with an energy minimization protocol to produce a structure wherein all atoms 
of the system are freely mobile; 

j) searching computer databanks to find honridogous human sequences for the variable light and va- 
riable heavy chains of the antibody; 
25 k) combining the sequences found In 0) to obtein human templates; 

I) comparing the structural template of (a) with the hunuin templates of (k) and selecting a human template 
with variable regk>ns having greater than 50 percent sequence klentity with the structural template; 
m) determining the CDR foops of the human template selected in (I); 

n) replacing the CDR loop region of the selected human template with the analogous sequences from 
30 the antibody to produce a Phase 1 humanized sequence; 

o) superimposing the models of the antibody and the Phase 1 humanized sequence to compare the 
binding site regions; 

p) identifying by tlw comparison in (o) an amino ackte bit he framework residues and 
interact wfththe antSxxiy CDR loops that can be important to thestructixal integrity of the antibody bMing ste; 
35 q) reinserting into the Phase 1 humanized sequence all amino acki residues identified in (p) to be dif- 

ferent from those In the antit>ody, and refining the resultent structure with an energy minimization pro- 
tocol to produce a Phase II humanized sequence; 

r) refining the Phase II humanized sequence using iterative confornuitional search protocols on all re- 
gions of the binding site and by analyste of the binding site to determine which regions of the CDR sur- 
40 face or residues at the CDR -framework junction are not likely to involve ant^en binding; and 

s) replacing the amino adds in the non-antigen binding regk)ns of the binding site with amino add re- 
sidues corresponding to the human residues to produce a humanized monodonal antibody. 

2. The method of Claim 1 wherein the nK)nodonal antibody is a murine antibody. 

45 

3. The method of Claim 2 wherein the nranodonal antibody is an anti-CD18 nrK>nodonal antibody. 

4. The method of Claim 3 wherein the monodonal antibody is 60.3 

5. A hunr^nized nrK>nodonal antibody having the structoral and binding characteristics of the anti- CD18 
^ monodonal antibody 60.3 

6. The humanized nrK)nodonal antibody of Claim 5 wherein the amino add at position 50 in Figure 2 is 
changed from Arg to Asp. 

55 7. Th humanized m nod nal antibody of Claim 6, wherein th amino add at positi n 54 is changed from 
Leu to Aig and th amino add at position 55 is changed from C^u to Ala. 

8. The humanized monodonal antibody of Claim 7 wherein the amino at posffion 68 is changed from Aig to Gly. 
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60^ Heavy Chain Sequences 



KflbiA hvw/ 2i4t heoj heoj heu moj 

• Jh4 1CL iMiplate PhMlV PffiMl/m mmml mM^ 




FIGURE 1 1/3 
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60^ Heavy Chain Sequences 



Kebat hVM/ 21^ hSOJ HBOS h60J liBOJ 
• JM "CL tamplsto RimIV ItiMBmi PhMl mSOS 




FIGURE 1 2/3 
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60.3 Heavy Chain Sequences 



Ktbat 



hVhl/ 21*3 hBU heOJ ti60J 
JM. XL iMiplM PhtMlV PfmmlUm 



HBOJ 



1b6(L3 



HV3 



FR4 




FR4 



* Th» h60 J htavy chain wu mad* with tha aaqu*noa ahown in lha 
Phaaa n/lll oolumn, wfth tha foOowmo axoapftbn: 
H71: Ajg instaadof Val 



FIGURE 1 3/3 
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60^ Uotn Chun SequenoM 




FIGURE 2 1/3 
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eo.3 Ughl Chain Sequences 



KBbst hvun/ hBoa fasoj mls heu 
« Jk tonvlatB PhnslV Phn»IMa Phml ««L3 




FIGURE 2 2/3 
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60.3 Light Chain Sequences 



KabBt hVknv h60J hBOia MOS hSOS 
# Jk tamplato PhMlV PhMlVn PhM»l n6(t3 




* The h60.3 light chain was made wHh the sequence shofwn In the 
Phase tI/llloolumfv«HW)th»folo«vkv«>o*P<)ons: 
L68: Axg MMdot Oy 
L7$: Tjr ImiMdof to 
L104: *Val InstMdof Lau 
L106: Glu inslMdof to 



FIGURE 2 3/3 
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nauRE3 

Oligonucleotide #1: 

TTriTroAATrCAAGCTTrCCroACTAQ\TGAGTGCATI^^ 
TTCAGATACCACCGGAGAAATrGTGCTAACACAA 

Oligonudeotide #2 

AATTTCrcCGGIXKjTATCKjAAATrcGAAATAAAACAGAAATGCAC^ 
TCAGGAAAGCTTGAATrCAAAAAA 

Oligonucleotide ffi 

TCTCCAGCTACATnn-ClTIXnxnCCAGGTGAGAGAGCCACTCTATCC^^ 
CCAGTGAAAGTGTTGATAGTTATGGCAATAGT 

Oligonucleotide #4 

ACrATCAACA(nTrcACTGG<nCTX3CAGGATAGAGTGGCTCTXnt^ 
AAAGACAATGTAGCrGGAGATTGTGTTAGCAC 

Oligonucleotide #S 

TrrATGCACTGGTACCAGCAGAAACCAGGACAGGCACCAAGGCrcCrcATCTAT 
CGTGCATCCAACCTAGAAACTGGTATCCCTGCC 

Oligonudeotide #6 

AGTTTCTAGGTTGGATGCACGATAGATGAGGAGCCTTQGTQCCnnXXnXK?^^ 
TGCTGGTACCAGTGCATAAAACTATTGCCATA 

Oligonucleotide #7 

AGGTrCAGTGGCAGTGGTTCTAGGACAGACTTCACItnXIACCTATI^^ 
AGCCTGAAGATnTGCAGTGTATTACTGTCAG 
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Oligonudeotide ifZ 

CACTGCAAAATOTCAGGCICrAGAGAAGAATAGGTGAGAGTGAAGTCItjTCCr 
AGAACCACTGCCACTGAACCrGGCAGGGATACC 

Oligonudeotide S9 

CAAAGTAATOAGQATCCltXOACGTTCGGTGGAGGCACCAAGGTGGAAGAGAA 
ACGTAAGTGCACTrrcCrCGAGGTCGACrTTTrr 

Oligonudeotide #10 

AAAAAAGTCGACCTCGAGGAAAGTGCACrrACGTrrcrCTrcCACCTr^^ 
CCACCGAACGTCaSAGGATCCTCATTACTITGCTGACAGTAATA 
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Oligtmucleodde i 1 

AAAAAAGAATIXXJAGCIOTCTGATAACGCTOTCCTTCTt^^ 
TGTCAGGrcCAACrrGTCCAGTCCGG 



Oligonucleotide #2 

TrAACTTCGGCACCGGACTGGACAAGlTGGACCroACACIXK}ACACCT^ 
AGAAGGACAGCGTTATCAGAAGAGCTCGAATrCTrmT 



Oligramdeotide 

TGCCXSAAGTTAAGAAQCCTXSGCGCrrCTGTGAAGGTCTCCTGCAAGGCTT^ 
CTACACCrrCACCGACTACrGGATGAACrGGGTTCG 



Oligonucleotide #4 

CCAGGTGCCnntTGAACCCAGTrCATCCAGTAGTCGGTGAAGGTGTAGCCAGAA 
GCCTrGCAGGAGACCTTCACAGAAGCGCCAGGCrrC 



Oligonucleotide #S 

ACAGGCACCTXKSACAGGGCCTAGAGTGGATGGGAAGGATTGATCCrrCCGATAG 
TGAAACTCACTACCATCAGAAGTrCCAGGGTAGGGT 



Oligonucleotide #6 

CGGGTCATrGTTACCCrACCCTGGAACTTCTGATTGTAGTGAGTTrcACT 
AAGGATCAATCCTTCCCT^TtXJACTCTAGGCCCTGT 

Oligonucleotide i97 

AACAATGACCCGAGACACATCCACCAGCACAGTCTACATGGAACrCAGCAGCCT 
GCGATCTQAGGACACCGCAGTCTATTACTGTGCACX} 
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Oligonucleotide #8 

agccgtccacctcgtgcacagtaataoacix3cggcgtcctx:agatc^^ 
croagttccatgtagactgtgctggtggatgtgtct 

CMigonudeotide #9 

AGGTGGACGGCTCGGTKXnTKXn'ATGGACrACTtKKKnCAAGG^ 
ACCGTCnx:CTCAGGTGATGCCrcACACTajAGGTCGACiril'lT 

Oligonudeotide #10 

AAAAAAGTCGACCTCGAGTGTGAGGACTCACCroAGGAGACGGTGACGAGGGT 
GCCTTGACCCCAGTAGTCCATAGCAAAGGAACCG 
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FACS Analysis 
Binding of FITC-m60.3 to HL60 ceiis 
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pGk.11 



1701 GGCC CAGGGGACTG TGAGGACAGA AGGCTTGTGG GTTTGAGGGA 

1751 GGACTGTCTT GCAGAGGATG ATAGGGTAAA ATAGAATGAA GGATGATTTT 
1801 TATAAATGGT TACGTGCCTT AGGATGACTA CATATTTAGT CCCTTATAAG 
1851 AGAAATTGAG TAGTTGGTAA AACAACAGAT AATAATTATT AAATGAGGAA 
1901 AGAGAGAAAC CACAGGTGCA AAGATTCACT TTATTTATTC ATTCTCCTCC 
1951 AACATTAGCA TAATTAAAGC CAAGGAGGAG GAGGGGGGTG AGGTGAAAGA 
2001 TGAGCTGGAG GACCGCAATA CGGGTAGGTC CCCTGTGGAA AAAGGGTCAG 
2051 AGGCCAAAGG ATGGGAGGG6 GTCAGGCTGG AACTGAGGAG CAGGTGGGGG 
2101 CACTTCTCCC TCTAACACTC TCCCCTGTTG AAGCTCTTTG TGACGGGCGA 
2151 GCTCAGGCCC TGATGGGTGA CTTCGCAGGC 6TAGACTTTG TGTTTCTCGT 
2201 AGTCTGCTTT GCTCAGCGTC AGGGTGCTGC TGAG6CTGTA GGTGCTGTCC 
2251 TTGCTCTCCT GCTCTGTGAC ACTCTCCTGG GAGTTACCCG ATTGGAGGGC 
2301 GTTATCCACC TTCCACTGTA CTTTGGCCTC TCTGGGATAG AAGTTATTCA 
2351 GCAGGCACAC AACAGAGGCA GTTCCAGATT TCAACTGCTC ATCAGATGGC 
2401 GGGAAGATGA AGACAGATGG TGCAGCCACA GTTCCTGAGG AAAGAAGCAA 
2451 ACAGGATGGT GTTTAAGTAA CAAAGTTCTG CCCTTGGGTG TGTTGTTTGC 
2501 GGATAAGGGC ATGTTAGGGA CAGACAGAAA ACAGCATGCT TATCCCAGAT 
2551 AATTATAGCA AGGAGACCAA 6AAGCGTATT TAAAATCTTG ATGTTTTGAG 
2601 TTTCTTCCTA GCTTCCCCCT ATTCCTTAAT AAAGTTCTAA ATTGTTTTGT 
2651 TGGAGCTCTT TGCAGCCATT CTGAGGGCTT TGCATGCTTT TCTGACCTTG 
2701 CAGTAAACTC AATGCTTTAG GCAAAGAATG GCCACGTCAT CCGACCCCCT 
2751 CAGAGTTTAG AATTCATCGA TATCTAGATC CTA6ATAATT GCATTCATTT 
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2801 AAAAAAAAAA TATTTCTCCT AAAATGAATA CTCAGAAAGT GGTCTTGAAA 
2851 AAGATTTGTG AAGCCGTTTT GACCAGAATG TCAAAGTCTT AATAGTAAGG 
2901 CAAAACAAAC AACTAAAAAA GATCATGAAC AAAGTCACTG TAAAGACTTC 
2951 GGGTATT6GA AAATAATTGA ATGGAGACCA ATAATCAGAG GGAA6AATAA 
3001 TAGAGTAATT TTAAGAAGTT TTCTAAATAT XTTAGAAATT AAAGACACTA 
3051 AAGTCCTTCA ATTTCTTACA TAACCTAATT TTGAAAATJSA ATTCTAAATA 
3101 CATTTTAGAA GTCGATAAAC TTAA6TTTGG G6AAACTAGA ACTACTCAAG 
3151 CTAAAATTAA AAGGTTGAAC TCAATAA6TT AAAAGAGGAC CTCTCCAGTT 
3201 TCGGCTGAAT CCTCAACTTA TTTTAGAAAT GCAAATTACC CAG6TGGTGT 
3251 TTTGCTCAGC CTGGACTTTC GGTTTG6TGG GGCTGGACAG AGTC TTTCAA 
3301 AACCACTTCT TCAAACCACA GCTACAAGTT TACCTAGTGG TTTTATTTTC 
3351 CCTTCCCCAA ATAGCCTTGC CACATGACCT GCTTCCTGCC AGCTGCTGCA 
3401 GGTGTTCTGG TTCTGATCGG CCATCTTGAC TCAACTCAAC ATTGCTCAAT 
3451 TCATTTAAAA ATATTTTAAA CTTAATTTAT TATT6TTAAA AGTCAGTTCT 
3501 GGATAGGGTA TCAGAGAGCC TCACTCCCAT TCCTCGGTTA AACTT TAAG T 
3551 AATGTCAGTT CTACACAAAC AAGACCTCAA ATTGATTGAC AAAAATTTTG 
3601 GACATTTAAA AAAATGAGTA CTTGAAAACC CTCTCACATT TTAAAGTCAC 
3651 AGTATTTAAC TATTTTTCCT AGGAACCAAC TTAAGAGTAA AAGCAACATC 
3701 TTCTAATATT CCATACACAT ACTTCTGTGT TCCTTTGAAA GCTGGACTTT 
3751 TGCAGGCTCC ACCAGACCTC TCTAGGATCT CGAGCTCGCG AAAGCTTGCA 
3801 TGCCTGCAGG TCGACTCTAG AGGATCAAAC CAACTGTCTT T6AGTA6AGC 
3851 CAAAATTGTT GATATACTTT 6AATTTTAAT TATATTTCTT GCTGAGCAGA 
3901 GGTGGCAAGA GTTTTCACTA ATGTGCAAAA CCACCTCATG TTCCCCTCAC 
3951 CTGGGAGCCA GAGTAGCAGG AGGAAGAGAA GCTGAGCTGG GGCTTCCATG 
4001 GTTCCCTCTG GGTCCTAACT GAGCAGTTCC TCCCCAGGGC TCTGACACAG 
4051 GCATTGATAT GGGCTCTGGA AGGTAGGGCA GCTGGGAGGG ACATGCAAAG 
4101 CAGCTGGGT6 GGAGCT6AGC TTCCAGCTGC AGAGACCACC TGCTTCTTCC 
4151 TCTCTGCACT GAGCATCCTG CGCCACCCTG GTTGTCAGGC CAGAAAAGTC 
4201 TGTTGGCTCA GTCTGA6TGT AGAACTTCTC CCTTGTGCTC AGAGAATTTC 
4251 ATTCCTATGT CTTTCTTCTC CTCAATCACC TAAATTCACC CAGATGATGT 
4301 TTGGCACAAG CCTGTTAAGA ACAATATAAA AGGCT6TGTT TTCATTTCTC 
4351 TCTTCCTATC CTCAATATGC CCAGTCATCT CCCTAAGTGC ATTATTGGAT 
4401 CCAGACATGA TAAGATACAT TGATGA6TTT GGACAAACCA CAACTAGAAT 
4451 GCAGTGAAAA AAATGCTTTA TTT6TGAAAT TTGTGATGCT ATTGCTTTAT 
4501 TTGTAACCAT TATAAGCTGC AATAAACAAG TTAACAACAA CAATTGCATO 
4551 CATTTTATGT TTCAGGTTCA GGGGGAGGTG TGGGAGGTTT TTTAAAGCAA 
4 601 GTAAAACCTC TACAAATGTG 6TATGGCTGA TTATGATCTC TA6TCAAGGC 
4 651 ACTATACATC AAATATTCCT TATTAACCCC TTTACAAATT AAAAAGCTAA 
4701 AGGTACACAA TTTTTGAGCA TAGTTATTAA TAGCAGACAC TCTATGCCTG 
4751 TGTGGAGTAA GAAAAAACAG TATGTTATGA TTATAACTGT TATGCCTACT 
4801 TATAAAGGTT ACAGAATATT TTTCCATAAT TTTCTTGTAT AGCAGTGCAG 
4851 CTTTTTCCTT TGTGGTGTAA ATAGCAAAGC AAGCAAGAGT TCTATTACTA 
4901 AACACAGCAT GACTCAAAAA ACTTAGCAAT TCTGAAGGAA AGTCCTTGGG 
4 951 GTCTTCTACC TTTCTCTTCT TTTTTGGAGG AGTAGAATGT TGAGAGTCAG 
5001 CAGTAGCCTC ATCATCACTA 6ATGGCATTT CTTCTGAGCA AAACAGGTTT 
5051 TCCTCATTAA AGGCATTCCA CCACTGCTCC CATTCATCAG TTCCATAGGT 
5101 T6GAATCTAA AATACACAAA CAATTAGAAT CAGTAGTTTA ACACATTATA 
5151 CACTTAAAAA TTTTATATTT ACCTTAGAGC TTTAAATCTC TGTAGGTAGT 
5201 TTGTCCAATT ATGTCACACC ACAGAAGTAA GGTTCCTTCA CAAAGATCCG 
5251 GGGCCCACTC ATAAATCCAG TTGCCGCCAC GGTAGCCAAT CACCGTATCG 
5301 TATAAATCAT CGTCGGTACG TTCGGCATCG CTCATCACAA TACGTGCCTG 
5351 GACGTCGAGG ATTTCGCGTG G6TCAATGCC GCGCCAGATC CACATCAGAC 
5401 GGTTAATCAT GCGATACCAG TGAGGGATGG TTTTACCATC AAGGGCCGAC 
5451 TGCACAGGCG GTTGTGCGCC GTGATTAAAG CGGCGGACTA GCGTCGAG6T 
5501 TTCAGGATGT TTAAAGCGGG GTTT6AACAG GGTTTCGCTC AGGTTTGCCT 
5551 GTGTCATGGA TGCAGCCTCC AGAATACTTA CTGGAAACTA TTGTAACCCG 
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2801 AAAAAAAAAA TATTTCTCCT AAAATGAATA CTCAGAAAGT GGTCTTGAAA 
2851 AAGATTTGTG AAGCCGTTTT GACCAGAATG TCAAAGTCTT AATAGTAAGG 
2901 CAAAACAAAC AACTAAAAAA GATCATGAAC AAA6TCACTG TAAAGACTTC 
2951 GGGTATTGGA AAATAATTGA ATGGAGACCA ATAATCAGAG G6AAGAATAA 
3001 TAGAGTAATT TTAAGAAGTT TTCTAAATAT ATTAGAAATT ^GACACTA 
3051 AAGTCCTTCA ATTTCTTACA TAACCTAATT TTGAAAATgA ATTCMlAATA 
3101 CATTTTAGAA GTCGATAAAC TTAAGTTTGG GGAAACTAGA ACTACTCAAG 
3151 CTAAAATTAA AAGGTTGAAC TCAATAA6TT AAAAGAGGAC CTCTCCAGOT 
3201 TCGGCTGAAT CCTCAACTTA TTTTAGAAAT GCAAATTACC CA6 GTGGT GT 
3251 TTTGCTCAGC CTGGACTTTC GGTTTGGTGG GGCTGGACAG ACTGTTTC AA 
3301 AACCACTTCT TCAAACCACA GCTACAAGTT TACCTAGTGG TTTTATTTTC 
3351 CCTTCCCCAA ATAGCCTTGC CACATGACCT GCTTCCTGCC AGCTGCTGCA 
3401 GGTGTTCTGG TTCTGATCGG CCATCTTGAC TCAACTCAAC ATTGCTCAAT 
3451 TCATTTAAAA ATATTTTAAA CTTAATTTAT TATTGTTAAA AGTCA6TTCT 
3501 6GATAGGGTA TGAGAGAGCC TCACTCCCAT TCCTCGGTTA AACT TTAAGT 
3551 AATGTCAGTT CTACACAAAC AAGACCTCAA ATTGATTGAC AAAAATTTTG 
3601 GACATTTAAA AAAATGAGTA CTT6AAAACC CTCTCACATT TTAAAGTCAC 
3651 AGTATTTAAC TATTTTTCCT AGGAACCAAC TTAAGAGTAA AAGCAACATC 
3701 TTCTAATATT CCATACACAT ACTTCTGTGT TCCTTT6AAA GCTGGACTTT 
3751 TGCAGGCTCC ACCAGACCTC TCTAGGATCT CGAGCTCGCG AAAGCTTGCA 
3801 TGCCTGCAGG TCGACTCTAG AGGATCAAAC CAACTGTCTT TGAGTAGAGC 
3851 CAAAATTGTT GATATACTTT GAATTTTAAT TATATTTCTT GCTGAGCAGA 
3901 GGTGGCAAGA GTTTTCACTA ATGTGCAAAA CCACCTCATG TTCCCCTCAC 
3951 CTGGGAGCCA GAGTAGCAGG AGGAAGAGAA GCTGAGCTGG GGCTTCCATG 
4001 GTTCCCTCTG GGTCCTAACT GAGCAGTTCC TCCCCAGGGC TCTGACACAG 
4051 GCATTGATAT GGGCTCTGGA AGGTAGGGCA GCTGGGAGGG ACATGCAAAG 
4101 CAGCTGGGTG GGAGCTGAGC TTCCAGCTGC AGA6ACCACC TGCTTCTTCC 
4151 TCTCTGCACT GAGCATCCTG CGCCACCCTG GTTGTCAGGC CAGAAAAGTC 
4201 TGTTGGCTCA GTCTGA6TGT AGAACTTCTC CCTTGTGCTC AGAGAATTTC 
4251 ATTCCTATGT CTTTCTTCTC CTCAATCACC TAAATTCACC CAGATGATGT 
4301 TTGGCACAAG CCTGTTAAGA ACAATATAAA AGGCTGTGTT TTCATTTCTC 
4351 TCTTCCTATC CTCAATATGC CCAGTCATCT CCCTAAGTGC ATTATTGGAT 
4401 CCAGACATGA TAAGATACAT TGATGA6TTT GGACAAACCA CAACTAGAAT 
4451 GCAGTGAAAA AAATGCTTTA TTTGTGAAAT TTGTGATGCT ATTGCTTTAT 
4501 TTGTAACCAT TATAAGCTGC AATAAACAAG TTAACA ACAA CAATTGCATT 
4551 CATTTTATGT TTCAGGTTCA GGGGGAGGTG TGGGAGGTTT TTTAAAGCAA 
4601 GTAAAACCTC TACAAATGTG GTATGGCTGA TTATGATCTC TAGTCAAGGC 
4 651 ACTATACATC AAATATTCCT TATTAACCCC TTTACAAATT AAAAAGCTAA 
4701 AGGTACACAA TTTTTGAGCA TAGTTATTAA TAGCAGACAC TCTATGCCTG 
4751 TGTGGAGTAA GAAAAAACAG TATGTTATGA TTATAACTGT TATGCCTACT 
4801 TATAAAGGTT ACAGAATATT TTTCCATAAT TTTCTTGTAT AGCAGTGCAG 
4851 CTTTTTCCTT TGTGGT6TAA ATAGCAAAGC AAGCAAGAGT TCTATTACTA 
4 901 AACACAGCAT GACTCAAAAA ACTTAGCAAT TCTGAAGGAA AGTCCTTGGG 
4951 GTCTTCTACC TTTCTCTTCT TTTTTGGAGG AGTAGAATGT TGAGAGTCAG 
5001 CAGTAGCCTC ATCATCACTA 6ATGGCATTT CTTCTGAGCA AAACAGGTTT 
5051 TCCTCATTAA AGGCATTCCA CCACTGCTCC CATTC ATCA G TTCCATAGGT 
5101 TGGAATCTAA AATACACAAA CAATTA6AAT CAGTAGTTTA ACACATTATA 
5151 CACTTAAAAA TTTTATATTT ACCTTAGAGC TTTAAATCTC TGTAGGTAGT 
5201 TTGTCCAATT ATGTCACACC ACAGAAGTAA GGTTCCTTCA CAAAGATCCG 
5251 GGGCCCACTC ATAAATCCAG TTGCCGCCAC GGTAGCCAAT CACCGTATCG 
5301 TATAAATCAT CGTCGGTACG TTCGGCATCG CTCATCACAA TACGTGCCTG 
5351 GAC6TCGAGG ATTTCGCGTG GGTCAATGCC GCGCCAGATC CACATCAGAC 
5401 GGTTAATCAT GCGATACCAG TGAGGGATGG TTTTACCATC AAGGGCCGAC 
5451 TGCACAGGCG GTTGTGCGCC GTGATTAAAG CGGCGGACTA GCGTCGAGGT 
5501 TTCAGGATGT TTAAAGCG6G GTTTGAACAG GGTTTCGCTC AGGTTTGCCT 
5551 GTGTCATGGA TGCAGCCTCC AGAATACTTA CTGGAAACTA TTGTAACCCG 
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5601 CCTGAAGTTA AAAAGAACAA CGCCCGGCAG TGCCAGGCGT TGAAAAGATT 
5651 AGCGACCGGA GATTGGCGGG ACGAATACGA CGCCCATATC CCACGGCTGT 
5701 TCAATCCAGG TATCTTGCGG GATATCAACA ACATAGTCAT CAACCAGCGG 
5751 ACGACCAGCC GGTTTTGC6A AGATGGTGAC AAAGTGCGCT TTTGGATACA 
5801 TTTCACGAAT CGCAACCGCA GTACCACCGG TATCCACCAG GTCA TCAATA 
5851 ACGATGAAGC CTTCGCCATC GCCTTCTGCG CGTTTCAGCA CTTTAAGCTC 
5901 GCGCTGGTTG TCGTGATCGT AGCTGGAAAT ACAAACGGTA TCGACATGAC 
5951 GAATACCCAG TTCACGCGCC AGTAACGCAC CCGGTACCAG ACCGCCACGG 
6001 CTTACGGCAA TAATGCCTTT CCATTGTTCA CAAGGCATCA GTCGGCTTGC 
€051 GAGTTTACGT GCATGGATCT GCAACATGTC CCAGGTGACG ATGTATTTTT 
6101 CGCTCATGTG AAGTGTCCCA GCCTGTTTAT CTACGGCTTA AAAAGTGTTC 
6151 GAGGGGAAAA TAGGTTGCGC GAGATTATAG AGATCAGCTT TTTGCAAAAG 
6201 CCTAGGCCTC CAAAAAAGCC TCCTCACTAC TTCTGGAATA GCTCAGAGGC 
6251 CGAGGCGGCC TCGGCCTCTG CATAAATAAA AAAAATTAGT CAGCCATGGG 
6301 GCGGAGAATG GGGCGGGATG GGCGGAGTTA GGGCGGAACT GGGCG6AGTT 
6351 AGGGGCGGGA CTATGGTTGC T6ACTAATTG AGATGCTGCA TTAATGAATC 
6401 GGCCAACGCG CGGGGAGAGG CGGTTTGCGT ATTGGGCGCT CTTCCGCTTC 
6451 CTCGCTCACT GACTCGCTGC GCTCGGTCGT TCGGCTGCGG CGAGCGGTAT 
6501 CAGCTCACTC AAAGGCGGTA ATACGGTTAT CCACAGAATC AGGGGATAAC. 
6551 GCAGGAAAGA ACATGTGAGC AAAAGGCCAG CAAAAGGCCA GGAACCGTAA 
6601 AAAGGCCGCG TTGCTGGCGT TTTTCCATAG GCTCCGCCCC CCTGACGAGC 
6651 ATCACAAAAA TCGACGCTCA AGTCAGAGGT GGCGAAACCC GACAGGACTA 
6701 TAAAGATACC AGGCGTTTCC CCCTGGAAGC TCCCTCGTGC GCTCTCCTGT 
6751 TCCGACCCTG CCGCTTACCG GATACCTGTC CGeCTTTCTC CCTTCGGGAA 
6801 GCGTGGCGCT TTCTCAATGC TCACGCTGrTA GGTXTCTCAG TTCGGTGTAG 
6851 GTCGTTCGCT CCAAGCTGGG CTGTGTGCAC GAACCCCCCG TTCAGCCCGA 
6901 CCGCTGCGCC TTATCCGGTA ACTATCGTCT TGAGTCCAAC CCGGTAAGAC 
6951 ACGACTTATC GCCACTGGCA GCAGCCACTG GTAACAGGAT TAGCAGAGCG 
7001 AGGTATGTAG GCGGTGCTAC AGAGTTCTTG AAGTGGTGGC CTAACTACGG 
7051 CTACACTAGA AGGACAGTAT TTGGTATCTG CGCTCTGCTG AAGCCAGTTA 
7101 CCTTCGGAAA AAGAGTTGGT AGCTCTTGAT CCGGCAAACA AACCACCGCT 
7151 GGTAGCGGTG GTTTTTTTGT TTGCAAGCAG CAGATTACGC GCAGAAAAAA 
7201 AGGATCTCAA GAAGATCCTT T6ATCTTTTC TACGGGGTCT 6ACGCTCAGT 
7251 GGAACGAAAA CTCACGTTAA GGGATTTTGG TCATGAGATT ATCAAAAAGG 
7301 ATCTTCACCT AGATCCTTTT AAATTAAAAA TGAAGTTTTA AATCAATCTA 
7351 AAGTATATAT GAGTAAACTT G6TCTGACAG TTACCAATGC TTAATCAGTG 
7401 AGGCACCTAT CTCAGCGATC TGTCTATTTC GTXCATCCAT AGTTGCCTGA 
7451 CTCCCCGTCG TGTAGATAAC 7ACGA7ACGG GAGGGCTTAC CATCTGGCCC 
7501 CAGTGCTGCA ATGATACCGC 6AGACCCACG CTCACCGGCT CCAGATTTAT 
7551 CAGCAATAAA CCAGCCAGCC GGAAGGGCCG AGCGCAGAAG TGGTCCTGCA 
7601 ACTTTATCCG CCTCCATCCA GTCTATTAAT TGTTGCCGGG AAGCTAGAGT 
7651 AAGTAGTTCG CCAGTTAATA GTTTGCGCAA CGTTGTTGCC ATTGCTACAG 
7701 GCATCGTGGT 6TCACGCTCG TCGTTTGGTA TGGCTTCATT CAGCTCCGGT 
7751 TCCCAACGAT CAAGGCGAGT TACATGATCC CCCATGTTGT GCAAAAAAGC 
7801 GGTTAGCTCC TTCGGTCCTC CGATCG7TGT CA6AAGTAAG TTGGCCGCAG 
7851 TGTTATCACT CATGGTTATG GCAGCACTGC ATAATTCTCT TACTGTCATG 
7901 CCATCCGTAA OATGCTTTTC TGTGACTGGT GAGTACTCAA CCAAGTCATT 
7951 CTGAGAATAG TGTATGCGGC GACCGAGTTG CTCTTGCCCG GCGTCAATAC 
8001 GGGATAATAC CGC6CCACAT AGCAGAACTT TAAAAGTGCT CATCATTGGA 
8051 AAACGTTCTT CGGGGCGAAA ACTCTCAAGG ATCTTACCGC TGTTGAGATC 
8101 CAGTTCGATG TAACCCACTC GTGCACCCAA CTGATCTTCA GCATCTTTTA 
8151 CTTTCACCAG CGTTTCTGGG TGAGCAAAAA CAGGAAGGCA AAATGCCGCA 
8201 AAAAAGGGAA TAAGGGCGAC ACGGAAATGT TGAA7ACTCA TACTCTTCCT 
8251 TTTTCAATAT TATTGAAGCA TTTATCAGGG TTATTGTCTC ATGAGCGGAT 
8301 ACATATTTGA ATGTATTTAG AAAAATAAAC AAATAGGG6T TCCGCGCACA 
8351 TTTCCCCGAA AAGTGCCACC T6ACGTCTAA 6AAACCATTA TTATCATGAC 



FIGURE 10 3/4 
61 



EP 0 578 515 A2 



EcoRl MCS 




SV40or1 

FIGURE 11 173 



EP0 578 515 A2 



EcoRI 




SV40 ori 



FIGDRE 11 2/3 



63 



EP0 578 515 A2 




A90 



FIGURE 11 3/3 



EP0 578 515 A2 



pNo1.16(2) 



1 AAGCTTTCTG GGGCAGGCCA GGCCTGACCT TGGCTTTGGG GCAGGGAGGG 
51 GGCTAAGGTG AGGCAGGTGG CGCCAGCCAG GTGCACACCC AATGCCCATG 
101 AGCCCAGACA CTGGACGCTG AACCTCGCGG ACAGTTAAGA ACCCAGGGGC 
151 CTCTGCGCCC TGGGCCCAGC TCTGTCCCAC ACCGCGGTCA CATGGCACCA 
201 CCTCTCTTGC AGCCTCCACC AAGGGCCCAT CGGTCTTCCC CCTGGCACCC 
251 TCCTCCAAGA GCACCTCTGG GGGCACAGCG GCCCTGGG6T GCCTGGTCAA 
301 GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA GGCGCCCTGA 
351 CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
401 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC 
451 CTACATCTGC AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAAC 
501 GCGTTGGTGA GAGGCCAGCA CAGGGAGGGA GGGTGTCTGC TGGAAGCCAG 
551 GCTCAGCGCT CCTGCCTGGA CGCATCCC6G CTATGCAGCC CCAGTCCAGG 
601 GCAGCAAG6C AGGCCCCGTC TGCCTCTTCA CCCGGAGGCC TCTGCCCGCC 
651 CCACTCATGC TCAGGGAGAG GGTCTTCTGG CTTTTTCCCC AGGCTCTGGG 
701 CAGGCACAGG CTAGGTGCCC CTAACCCAGG CCCTGCACAC AAAGGGGCAG 
751 6TGCTGGGCT CAGACCTGCC AAGAGCCATA TCCGGGAGGA CCCTGCCCCT 
eOl GACCTAAGCC CACCCCAAAG GCCAAACTCT CCACTCCCTC AGCTCGGACA 
851 CCTTCTCTCC TCCCA6ATTC CAGTAACTCC CAATCTTCTC TCTGCAGAGC 
901 CCAAATCTTG TGACAAAACT CACACATGCC CACCGTGCCC AGGTAAGCCA 
951 GCCCAGGCCT CGCCCTCCAG CTCAAGGCGG 6ACAGGTGCC CTAGAGTAGC 
1001 CTGCATCCAG GGACAGGCCC CAGCCGGGTG CTGACACGTC CACCTCCATC 
1051 TCTTCCTCAG CACCTGAACT CCTGGGGGGA CCGTCAGTCT TCCTCTTCCC 
1101 CCCAAAACCC AAGGACACCC TCATGATCTC CCGGACCCGT GAGGTCACAT 
1151 GCGTGGTGGT GGACGTGAGC CACGAAGACC CTGAGGTCAA GTTCAACTGG 
1201 TACGTGGACG GCGTGGA6GT GCATAATGCC AAGACAAAGC CGCGGGAGGA 
1251 GCAGTACAAC AGCACGTACC GTGTGGTCAG CGTCCTCACC GTCCTGCACC 
1301 AGGACTGGCT GAATGGCAAG GAGTACAAGT GCAAGGTCTC CAACAAAGCC 
1351 CTCCCAGCCC CCATCGAGAA AACCATCTCC AAAGCCAAAG GTGGGACCCG 
1401 TGGGGTGCGA GGGCCACATG GACAGAGGCC GGCTCGGCCC ACCCTCTGCC 
1451 CTGAGAGTGA CCGCTGTACC AACCTCTGTC CCTACAGGGC AGCCCCGAGA 
1501 ACCACAGGTG TACACCCTGC CCCCATCTAG AGAGGAGATG ACCAAGAACC 
1551 AGGTCAGCCT GACCTGCCTG GTCAAAGGCT TCTATCCCAG CGACATCGCC 
1601 GTGGAGTGGG AGAGCAATGG GCAGCCGGAG AACAACTACA AGACCACGCC 
1651 TCCCGTGCTG GACTCCGACG GCTCCTTCTT CCTCTACAGC AAGCTCACCG 
1701 TGGACAAGAG CAGGTGGCAG CAGGGGAACG TCTTCTCATG CTCCGT6ATG 
1751 CATGAGGCTC TGCACAACCA CTACACGCAG AAGAGCCTCT CCCTGTCTCC 
1801 GGGTAAATGA GTGCGACGGC CGGCAAGCCC CCGCTCCCCG GGCTCTCGCG 
1851 GTCGCACGAG GATGCTTGGC ACGTACCCCC TGTACATACT TCCCGGGCGC 
1901 CCAGCATGGA AATAAAGCAC CCAGCGCTGC CCTGGGCCCC TGCGAGACTG 
1951 TGATGGTTCT TTCCACGGGT CAGGCCGAGT CTGAGGCCTG A6TGGCATGA 
2001 GGGAGGCAGA GCGGGTCCCA CTGTCCCCAC ACTGGCCCAG GCTGTGCAGG 
2051 TGTGCCTGGG CCCCCTAGGG TGGGGCTCAG CCAGGGGCTG CCCTCGGCAG 
2101 GGTGGGGGAT TTGCCAGCGT GGCCCTCCCT CCAGCAGCAC CTGCCCTGGG 
2151 CTGGGCCACG GGAAGCCCTA GGAGCCCCTG GGGACAGACA CACAGCCCCT 
2201 GCCTCTGTAG GAGACTGTCC TGTTCTGTGA GCGCCCCTGT CCTCCCGACC 
2251 TCCATGCCCA CTCGGGGGCA TGCCTAGTCC ATGTGCGTAG GGACAGGCCC 
2301 TCCCTCACCC ATCTACCCCC ACGGCACTAA CCCCTGGCTG CCCTGCCCAG 
2351 CCTCGCACCC GCATGGGGAC ACAACCGACT CCGGGGACAT GCACTCTCGG 
2401 GCCCTGTGGA GGGACTGGTG CAGATGCCCA CACACACAC7 CAGCCCAGAC 
2451 CCGTTCAACA AACCCCGCAC TGAGGTTGGC CGGCCACACG GCCACCACAC 
2501 ACACACGTGC ACGCCTCACA CACGGAGCCT CACCCGGGCG AACTGCACAG 
2551 CACCCAGACC AGAGCAAGGT CCTCGCACAC GTGAACACTC CTCGGACACA 
2601 GGCCCCCACG AGCCCCACGC GGCACCTCAA GGCCCACGAG CCTCTCGGCA 
2 651 GCTTCTCCAC ATGCTGACCT GCTCAGACAA ACCCAGCCCT CCTCTCACAA 
2701 GGGTGCCCCT GCAGCCGCCA CACACACACA GGGGATCACA CACCACGTCA 
2751 CGTCCCTGGC CCTGGCCCAC TTCCCA6TGC CGCCCTTCCC TGCAGGACGG 
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2801 ATCCAGACAT GATAAGATAC ATTGATGAGT TTGGACAAAC CACAAC TAGA 
2851 ATGCAGTGAA AAAAATGCTT TATTTGTGAA ATTTGTGATG CTATTGCTTT 
2901 ATTTGTAACC ATTATAAGCT GCAATAAACA AGTTAACAAC AACAATTGCA 
2951 TTCATTTTAT GTTTCAGGTT CAGGGGGAGG TGTGGGAGGT TTTTTAAAGC 
3001 AAGTAAAACC TCTACAAATG TGGTATGGCT 6ATTATGATC TCTAGTCAAG 
3051 GCACTATACA TCAAATATTC CTTATTAACC CCTTTACAAA TTAAAAAGCT 
3101 AAAGGTACAC AATTTTTGAG CATAGTTATT AATAGCA6AC ACTCTATGCC 
3151 TGTGTGGAGT AAGAAAAAAC AGTATGTTAT GATTATAACT GTTATGCCTA 
3201 CTTATAAAGG TTACA6AATA TPTTTCCATA ATTTTCTTGT ATAGCAGTGC 
3251 AGCTTTTTCC TTTGTGGTGT AAATAGCAAA GCAAGCAAGA GTTCTATTAC 
3301 TAAACACAGC ATGACTCAAA AAACTTAGCA ATTCTGAAGG AAAGTCCTTG 
3351 GGGTCTTCTA CCTTTCTCTT CTTTTTTGGA GGAGTAGAAT GTTGAGAGTC 
3401 AGCAGTAGCC TCATCATCAC TAGATGGCAT TTCTTCTGAG CAAAACAGGT 
3451 TTTCCTCATT AAAGGCATTC CACCACTGCT CCCATTCATC AGTTCCATAG 
3501 GTTGGAATCT AAAATACACA AACAATTAGA ATCAGTAGTT TAACACATTA 
3551 TACACTTAAA AATTTTATAT TTACCTTAGA GCTTTAAATC TCTGTAGGTA 
3601 GTTTGTCCAA TTATGTCACA CCACAGAAGT AAG6TTCCTT CACAAAGATC 
3651 CGGGACCAAA GCGGCCATCG TGCCTCCCCA CTCCTGCAGT TCGGGGGCAT 
3701 GGATGCGCGG ATAGCCGCTG CTGGTTTCCT GGATGCCGAC GGATTTGCAC 
3751 TGCCGGTAGA ACTCCGCGAG GTCGTCCAGC CTCAGGCAGC AGCTGAACCA' 
3801 ACTCGCGAGG GGATCGAGCC CGGGGTGGGC 6AA6AACTCC AGCATGAGAT 
3851 CCCCGCGCTG GAGGATCATC CAGCCGGCGT CCCGGAAAAC GATTCCGAAG 
3901 CCCAACCTTT CATAGAAGGC GGCGGTG6AA TCGAAATCTC GTGATGGCAG 
3951 GTTGGGCGTC GCTTGGTCGG TCATTTCGAA CCCCAGAGTC CCGCTCAGAA 
4001 GAACTC6TCA AGAAGGCGAT AGAAGGCGAT GCGCTGCGAA TCGGGAGCGG 
4051 CGATACCGTA AAGCACGAGG AAGCGGTCAG CCCATTCGCC GCCAAGCTCT 
4101 TCAGCAATAT CACGGGTAGC CAACGCTATG TCCTGATAGC GGTCCGCCAC 
4151 ACCCAGCCGG CCACAGTCGA TGAATCCAGA AAAGCGGCCA TTTTCCACCA 
42*01 TGATATTCGG CAAGCAGGCA TCGCCATGGG TCACGACGAG ATCCTCGCCG 
4251 TCGGGCATGC GCGCCTTGAG CCTGGCGAAC AGTTCGGCTG GCGCGAGCCC 
4301 CTGATGCTCT TCGTCCAGAT CATCCTGATC 6ACAAGACCG GCTTCCATCC 
4351 GAGTACGTGC TCGCTCGATG CGAT6TTTCG CTTGGTGGTC GAATGGGCAG 
4401 GXAGCCGGAT CAAGCGTATG CAGCCGCCGC ATTGCATCAG CCATGATGGA 
4451 TACTTTCTCG GCAGGAGCAA GGTGAGATGA CAGGAGATCC TGCCCCGGCA 
4501 CTTCGCCCAA TAGCAGCCAG TCCCTTCCCG CTTCAGTGAC AACGTC6AGC 
4551 ACAGCTGCGC AAGGAACGCC CGTCGTG6CC AGCCACGATA GCCGCGCTGC 
4 601 CTCGTCCTGC AGTTCATTCA GGGCACCGGA CAGGTCGGTC TTGACAAAAA 
4651 GAACCGGGCG CCCCTGCGCT GACAGCCGGA ACACGGCGGC ATCAGAGCAG 
4701 CCGATTGTCT GTTGTGCCCA GTCATAGCCG AATAGCCTCT CCACCCAAGC 
4751 GGCCGGAGAA CCTGCGTGCA ATCCATCTTG TTCAATCATG C6AAACGATC 
4801 CTCATCCTGT CTCTTGATCA GATCTTGATC CCCTGCGCCA TCAGATCCTT 
4851 GGCGGCAAGA AAGCCATCCA GTTTACTTTG CAGGGCTTCC CAACCTTACC 
4 901 AGAGGGCGCC CCAGCTGGCA ATTCCGGTTC GCTTGCTGTC CATAAAACCG 
4 951 CCCAGTCTAG CTATCGCCAT GTAAGCCCAC TGCAAGCTAC CTGCTTTCTC 
5001 TTTGCGCTTG CGTTTTCCCT TGTCCAGATA GCCCAGTAGC TGACATTCAT 
5051 CCGGGGTCAG CACCGTTTCT GCGGACTGGC TTTCTACGTG TTCCGCTTCC 
5101 TTTAGCAGCC CTTGCGCCCT GAGTGCTTGC GGCAGCGTGA AGCTAGCTTT 
5151 TTGCAAAAGC CTAGGCCTCC AAAAAAGCCT CCTCACTACT TCTGGAATAG 
5201 CTCAGAGGCC 6AGGCGGCCT CGGCCTCTGC ATAAATAAAA AAAATTAGTC 
5251 AGCCATGGGG CGGAGAATGG GGCGGGATGG GCGGAGTTAG GGCGGAACTG 
5301 GGCGGAGTTA GGGGCGGGAC TATGGTTGCT GACTAATTGA 6ATGCATGCT 
5351 TTGCATACTT CTGCCTGCTG GGGAGCCTGG GGACTTTCCA CACCTGGTTG 
5401 CTGACTAATT GAGATGCATG CTTTGCATAC TTCTGCCTGC TGGGGAGCCT 
5451 GGGGACTTTC CACACCCTAA CTGACACACA TTCCACAGCT GCCTCGCGCG 
5501 TTTCGGTGAT GACGGTGAAA ACCTCTGACA CATGCAGCTC CCGGAGACGG 
5551 TCACAGCTTG TCTGTAAGCG GATGCCGGGA GCAGACAAGC CCGTCAGGGC 
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5601 GCGTCAGCGG GTGTTGGCGG GTGTCGGGGC GCAGCCATGA CCCAGTCACG 
5651 TAGCGATAGC GGAGTGTATA CTGGCTTAAC TATGCGGCAT CAGAGCAGAT 
5701 TGTACTGAGA GTGCACCATA TGCGGTGTGA AATACCGCAC AGATGCGTAA 
5751 GGAGAAAATA CCGCATCAGG CGCTCTTCCG CTTCCTCGCT CACTGACTCG 
5801 CTGCGCTCGG TCGTTCGGCT GCGGC6AGCG GTATCAGCTC ACTCAAAGGC 
5851 GGTAATACGG TTATCCACAG AATCAGGG6A TAACGCAGGA AAGAACATCT 
5901 GAGCAAAAGG CCAGCAAAAG GCCAGGAACC GTAAAAAGGC CGCGTTGCTG 
5951 GCGTTTTTCC ATAGGCTCCG CCCCCCTGAC 6AGCATCACA AAAATCGACG 
6001 CTCAAGTCAG AGGTGGCGAA ACCCGACAGG ACTATAAAGA TACCAGGCGT 
6051 TTCCCCCTGG AAGCTCCCTC GTGCGCTCTC CT6TTCCGAC CCTOCCGCTT 
6101 ACCGGATACC TGTCCGCCTT TCTCCCTTCG GGAAGCGTGG CGCTTTCTCA 
6151 TAGCTCACGC TGTAGGTATC TCAGTTCGGT GTAGGTCGTT CGCTCCAAGC 
6201 TGGGCTGTGT GCACGAACCC CCCGTTCAGC CCGACCGCTG CGCCTTATCC 
6251 6GTAACTATC GTCTTGA6TC CAACCCGGTA AGACAC6ACT TATC6CCACT 
6301 GGCAGCAGCC ACTGGTAACA GGATTAGCAG AGCGAGGTAT 6TAGGCGGTG 
6351 CTACAGAGTT CTTGAAGTGG TGGCCTAACT ACGGCTACAC TAGAAGGACA 
6401 GTATTTGGTA TCTGCGCTCT GCTGAAGCCA GTTACCTTCG GAAAAAGAGT 
6451 TGGTAGCTCT TGATCCGGCA AACAAACCAC CGCTGGTAGC GGTGGTTTTT 
6501 TTGTTTGCAA GCAGCAGATT ACGCGCAGAA AAAAAGGATC TCAAGAAGAT 
6551 CCTTTGATCT TTTCTACGGG GTCTGACGCT CA6TGGAACG AAAACTCACG 
6601 TTAAGGGATT TTGGTCATGA GATTATCAAA AAGGATCTTC ACCTAGATCC 
6651 TTTTAAATTA AAAATGAAGT TTTAAATCAA TCTAAAGTAT ATATGA6TAA 
6701 ACTTGGTCTG ACAGTTACCA ATGCTTAATC AGTGAfGCAC CTATCTCAGC 
6751 6ATCTGTCTA TTTCGTTCAT CCATAGTTGC CTGACTCCCC GTCGTGTAGA 
6801 TAACTACGAT ACGGGAGGGC TTACCATCTG GCCCCAGTGC TGCAATGATA 
6851 CCGCGAGACC CACGCTCACC GGCTCCAGAT TTATCAGCAA TAAACCAGCC 
6901 AGCCGGAAGG GCCGAGCGCA GAAGTGGTCC TGCAACTTTA TCCGCCTCCA 
6951 TCCAGTCTAT TAATTGTTGC CGGGAAGCTA GAGTAAGTAG TTCGCCAGTT 
7001 AATAGTTTGC GCAACGTTGT TGCCATTGCT GCAGGCATCG TGGTGTCACG 
7051 CTCGTCGTTT GGTATGGCTT CATTCAGCTC CGGTTCCCAA CGATCAAGGC 
7101 GAGTTACATG ATCCCCCATG TTGTGCAAAA AAGCGGTTAG CTCCTTCGGT 
7151 CCTCCGATCG TTGTCAGAAG TAAGTTGGCC GCAGTGTTAT CACTCATGGT 
7201 TATGGCAGCA CTGCATAATT CTCTTACTGT CATGCCATCC GTAAGATGCT 
7251 TTTCTGTGAC TGGTGAGTAC TCAACCAAGT CATTCTGAGA AtTAGTGTATG 
7301 C6GCGACCGA GTTGCTCTTG CCCGGCGTCA ACACGGGATA ATACCGCGCC 
7351 ACATAGCAGA ACTTTAAAAG TGCTCATCAT TGGAAAACGT TCTTCGGGGC 
7401 GAAAACTCTC AAGGATCTTA CCGCTGWGA 6ATCCAGTTC GATGTAACCC 
7451 ACTCGTGCAC CCAACTGATC TTCAGCATCT TTTACTTTCA CCAGCGTTTC 
7501 TGGGTGAGCA AAAACA6GAA GGCAAAATGC CGC AAAAA AG GGAATAAGGG 
7551 CGACACGGAA ATGTTGAATA CTCATACTCT TCCTTTTTCA ATATTATTGA 
7601 AGCATTTATC AGGGTTATTG TCTCATGAGC GGATACATAT TTGAATGTAT 
7651 TTAGAAAAAT AAACAAATAG GGGTTCCGCG CACATTTCCC CGAAAAGTGC 
7701 CACCTGACGT CXAAGAAACC ATTATTATCA TGACATTAAC CTATAAAAAT 
7751 AGGCGTATCA CGAGGCCCTT TCGTCTTCAA GAATTCATCG ATAT CGGAAA 
7801 ATGAAAAAAA ATATTTTTTA ATTTTAAAAT 6AAATGTTTA TTTTCAATTT 
7851 CTCCAAATTT CACAAGGAAA GATTAGTCAC GGGTATGGGA 6AGCAGAGGA 
7901 CCATAAGAGT TCAGGAATAG AATCCATTAT GATTCTGGAG TCAAGGAA6T 
7951 ACTGATGCCA AGGTTTCAGT ATAAGAGCAG TATCCACTGG AAAGGATAAA 
8001 GTCACTACAA CTGAGCACAG AGCAGGACAG CTACCTAATG A6TGGTCACT 
8051 AATGGGCCAC TGTTACACTG TTATACGGCT TAGGAATGAG CACTGAGGCT 
8101 GTGAGGTGTA TGGGTAAGGA CATCAGGATG TAAACCCAGC TCAGGTAGAG 
8151 GACTCAGAGC ACAGCACAAT CAGCACGAAC TAATAAACAA CAGATAAGAT 
8201 AAGGCACAAG CTCAGCAATA TTGGATCAGG 6ATCTTT6TA AATCTGACTG 
8251 TGTATTCAGT CTAGTTCAAT GTGACTCATG AAGCCCACCC ATATGCAAAT 
8301 CTAGAGAAGA CTTTAGAGTA TAAATCT6AG GCTCACCTCA CATACCAGCA 
8351 AGGGAGTGAC CAGCTTGTCT TAAGGCACCA CTGAGCCCAA GTCTTAGACA 
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8401 TCATGGATTG GCTGTGGAAC TTGCTATTCC TGATGGCAGC TGCCCAAGGT 
8451 AAGTCATCAG AAAAAAGAGT TCCAAGGGAA ATTGAAGCAG TTCCGAGCTC 
8501 GGTACCCTCG A6ATCCTAGA 6AGGTCTGGT GGAGCCTGCA AAAGTCCAGC 
8551 TTTCAAAGGA ACACAGAAGT ATCTGTATGG AATATTAGAA GATG TTGCT T 
8601 TTACTCTTAA GTTGGTTCCT ACGAAAAATA 6TTAAATACT GTGAC TTTAA 
8651 AATGTGAGAG GGTTTTCAAG TACTCATTTT TTTAAATGTC CAAAATTTTT 
8701 GTCAATCAAT TTGAGGTCTT 6TTTGT6TAG AACTGACATT ACTTAAAGTT 
8751 TAACC6AGGA ATGGGAGTGA GGCTCTCTCA TACC CTATCC A6AACTGACT 
8801 TTTAACAATA ATAAATTAAG TTTAAAAIAT TTTTAAATGA XTTGAGCAAT 
8851 GTTGAGTTGA GTCAAGATGG CCGATCAGAA CCAGAACACC TGCAGCAGCT 
8901 GGCAG6AAGC AGGTCATGTG GCAAGGCTAT TPGGGGAAGG GAAAATAAAA 
8951 CCACTAGGTA AACTT6TAGC TGTGGTTT6A AGAAGTGGTT TTGAAACACT 
9001 CTGTCCAGCC CCACCAAACC GAAAGTCCAG GCTGAGCAAA ACACCACCTG 
9051 G6TAATTTGC ATTTCTAAAA TAAGTTGAGG ATTCAGCCGA AACTGGAGAG 
9101 GTCCTCTTTT AACTTATTGA GTTCAACCTT TTAATTTTA6 CTTGAGTAGT 
9151 TCTAGTTTCC CCAAACTTAA GTTXATCGAC TTCTAAAATG TATTTAGAAT 
9201 T 
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